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1.0 EXECUTIVE SUMMARY 


This report documents the results from the research performed under the contract F49629-01- 
C-0027 entitled “Robust Adaptive Control of UCAVs”. Our research was aimed at developing 
robust nonlinear adaptive control algorithms for UCAV flight control, flexible mode 
compensation and stabilization, and electromagnetic actuators. Specific techniques investigated 
were Internal-Model-Principle-Based Nonlinear Output Regulation and the Adaptive Inverse 
Method. For comparison purposes, Neural Networks was also investigated. 


11 Robust Adaptive Nonlinear Output Regulation 


In order to develop nonlinear adaptive control for UCAVs, Internal Model Principle 
Nonlinear Output Regulation (IMPNOR) technique was investigated. This controller design 
technique was applied to structural mode filter design, and UCAV adaptive flight control, and 
the performance was evaluated for implementation. 


For the structural mode filter design, the classical notch filter is often used in practice. The 
draw back to this approach is that the frequency to be suppressed needs to be known exactly, 
otherwise the filter would miss the target frequency and fail to suppress it. In addition, once the 
filter has been designed, the target frequency would be fixed and cannot accommodate any other 
frequencies. As will be discussed in the next section, the frequency that needs to be attenuated 
could change depending on maneuvers the aircraft performs, particularly for flexible aircraft 
such as UCAVs. Hence, the goal was to develop an adaptive method to dynamically filter out 
the undesirable effects of the structural modes on the flight control sensors. As will be discussed 
in Section 3, the IMPNOR method was employed to design a control law that tracks the 
reference command for take-off and landing while suppressing the undesirable harmonic signals 
whose frequencies were not known a priori. The simulation results show that simultaneous 
command tracking and disturbance rejection were achieved successfully by the IMPNOR 
method. 


For the UCAV adaptive flight control, we applied the IMPNOR method to the X-45A linear 
model and designed flight control laws which accommodate surface failures and nonlinear 
modeling errors. Both inner-loop and outer-loop responses were evaluated with and without 
surface failures. The IMPNOR method tracked the reference command successfully in spite of 
the surface failures. To assess the performance of the resulting IMPNOR control laws, an 
alternative method using Neural Networks (NN) was applied to the same linear model and the 
results were compared. Both IMPNOR and NN controllers seemed to track the command well. 
The details are discussed in Section 4. 


1.2 Adaptive Control of Electromagnetic Actuators 


The Adaptive Inverse Control method developed by G. Tao and P. Kokotovic was 
investigated to design a controller for an electromagnetic actuator (EMA). Specifically, we 
applied the method to compensate the nonlinear dynamics that appear in the Miniaturized 
Munition Technology (MMT) actuator subsystems known as dead-zone, Coulomb friction, and 
quantization errors. The simulation results showed the Adaptation Inverse Control method 
performed better than a detuned fixed inverse controller or a controller with no inverse model 
associated with it. However, the Adaptive Inverse Control method also introduced chattering, 
slow adaptation process, and less robustness to larger magnitude of initial parameter estimation 
errors. To alleviate these problems we considered the IMPNOR technique as an alternative 
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method, and applied it to MMT actuator subsystems. The simulation results showed that 
IMPNOR resolved the chattering and robustness issues while tracking the command in the 
presence of harmonic disturbances. The details are discussed in Section 5. 


1.3 Aircraft Departure Analysis/Prediction Tool 


We investigated a robustness analysis method appeared in [10] in an attempt to develop an 
Aircraft Departure Analysis/Prediction Tool. We also visited Dr. R. Murray and Dr. J. Doyle at 
California Institute of Technology, who are two of the co-authors of [10] to discuss the method. 
After the discussion, and further investigation, we decided that the method is not mature enough 
at the present time for technology transition, and that we will not proceed any further on this 
task. 


1.4 Research Collaboration with Universities 


Throughout the entire contract period of performance for this research program, Boeing and 
Washington University (Dr. C. Byrnes and Dr. A. Ishidori) had a close working relationship and 
shared the interest in applying the Internal Model Principle Nonlinear Output Regulation Control 
technique to real aircraft system in an effort to transition the technology emanating from this 
research program. The research collaboration between Boeing and Washington University 
culminated in publication of [11]. 


We also had a dialogue for potential research collaboration with Dr. Murray at California 
Institute of Technology (CalTech) in the area of robustness analysis techniques. Boeing and 
CalTech had a technical coordination meeting where both parties expressed interest in 
collaboration, and decided that we keep searching for research common ground by keeping an 
open dialogue. 


Boeing also had research collaboration with Dr. Hovakimyan at Virginia Polytechnic 
Institute and State University in the area of Air Collision Avoidance problem from the Control 
Theory point of view. This research collaboration was culminated in publication of [12]. 
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2.0 INTRODUCTION 


Affordability and safety have become the prime focus for today's defense research. New 
Unmanned Combat Air Vehicles (UCAVs) are being developed to significantly reduce the cost 
compared to existing manned systems. Smart GPS guided weapons are being developed to 
improve mission effectiveness and reduce costs per target kill. To answer to these challenging 
demands and/or needs, our research was centered around UCAVs and advanced munitions such 
as Joint Direct Attack Munitions (JDAM) and the Small Smart Bomb (250 Ib weapon under 
development), and the efforts were focused on the development of new control algorithms that 
would improve flight safety, mission reliability, and cost reduction for UCAVs and these 
advanced weapon systems. 


The specific areas of primary investigation under this AFOSR program were: 
e Robust adaptive control algorithms for flight control 
— Robust adaptive control laws 
- Adaptive aeroservoelastic compensation for flexible aircraft 
e Adaptive control compensation for electric actuators 
e Aircraft departure analysis and prediction tools 


These investigations were performed using new theory on robust output regulation for 
nonlinear systems being developed at Washington University in St. Louis (Byrnes and Isidori), 
robust adaptive control methods developed at UC Santa Barbara (Kokotovic), and robustness 
analysis of nonlinear systems developed at California Institute of Technology (Murray). 


Robust adaptive output regulation [1]-[5] offers a potential to enhance flight performance and 
improve robustness to uncertain aerodynamics and structural dynamics. Our research 
investigated this new approach [5], and compared it with the adaptive neural networks approach 
using an X-45A linearized model. One of our goals was to demonstrate this technology in a 
flight control application, and to extend it to design an adaptive aeroservoelastic compensator for 
flexible aircraft, which would fulfill a critical need currently not present in industry. 


Our research on adaptive control compensation for electric actuators is directly applicable to 
both ΠΌΑΝ and JDAM. UCAV is using electric actuators that are a derivative of the JODAM 
design. Our research in this area would significantly reduce the cost of these actuators. Each 
UCAV uses 18 actuators, and each JDAM uses 3 actuators. Our success would lead to reduced’ 
parts count, reduced weight, and reduced cost for each actuator. Multiplying by the production 
numbers for JDAM, and the potential savings from UCAV and the other munitions programs 
that are under development, the potential savings are enormous. 


In addition to being affordable, UCAV must be reliable and safe, and must have the ability to 
integrate with manned aircraft operations. The missions to be performed by UCAV are 
highlighted in Figure 2-1. 


Robust adaptive control and self-diagnostics, prognostics, and health monitoring 
technologies are needed to give UCAV the same reliability and safety of manned aircraft. 
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Figure 2-1. UCAV Suppression of Air Enemy Defense Mission Scenarios 


SAM Site 


Autonomous ground operations, robust flight control, and multi-vehicle coordinated 
operations are being designed for UCAV to minimize operator workload, increase mission 
reliability, and reduce support costs. These new levels of vehicle autonomy introduce new 
requirements on the aircraft’s control system. Sophisticated contingency management 
algorithms must replace the pilot's ability to adapt to unforeseen and unpredictable events to 
guarantee flight safety and mission reliability. The aircraft’s control system must be able to 
respond to these contingencies. Figure 2-2 illustrates the guidance and control modes for 
UCAV, as well as several of the contingency modes. The figure illustrates autonomous ground 
operations (auto taxi and takeoff), ingress to target area, mission execution, and safe return to 
base. 
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; «Engine on ¢Pre-Takeoff test *Cruise ¢Holding pattern 
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modes *Pre/Post * Takeoff roll *Flare 
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Figure 2-2. UCAV Guidance, Control, and Contingency Modes 
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Robust adaptive control plays a critical role in the autonomous control of UCAVs. These 
algorithms must accommodate unknown or uncertain aerodynamics, flight control actuator 
failures and/or battle damage. Our research addresses the need for flight control and for adaptive 
aeroservoelastic compensation. The need for reconfigurable flight control is well understood, 
and alternate solutions to this problem exist. The need for adaptive aeroservoelastic 
compensation for flexible aircraft remains unanswered. 


The X-36 Agility Research Aircraft shown in Figure 2-3 is a 28% scale test aircraft used to 
demonstrate agility and handling qualities in a tailless fighter (unstable in multiple axes). The 
X-36 is all composite aircraft that proved to have structural modes that depend upon flight 
condition. Due to both the aerodynamic loading on the fuselage and wing and the control 
surface deflection, the structural modes varied in frequency and in damping. Designing 
compensation to attenuate the effects of the structural modes on the flight control sensors (gyros 
and accelerometers) proved to be costly in time and manpower, and introduced risk into the 
program. The resulting compensation used filters that were scheduled with flight condition, 
including control surface deflection. 


steak ἘΠ , 


wird tent 


GP44275003.ppt 


Figure 2-3. X-36 Agility Research Aircraft 


Key in making the Boeing UCAV design affordable has been the incorporation of all-electric 
aircraft technologies. Existing aircraft use hydraulic actuation systems that are expensive, heavy, 
and require significant maintenance. The Boeing UCAV design currently has 18 electro- 
magnetic actuators (EMAs). Six electric actuators are used to control UCAV's flight control 
surfaces, as shown in Figure 2-4, with 2 actuators used to control the thrust vectoring system, 
and additional 10 actuators to control the nose and main landing gear, gear doors, steering 
system, and weapon bay doors. 


UCAV has baselined identical actuators for all 18 EMAs. This in itself (using the same 
actuator everywhere on the aircraft) greatly reduces aircraft costs and logistics footprint. These 
UCAV electric actuators are manufactured by HR Textron under subcontract to Boeing. The HR 
Textron UCAV design is a derivative of the Joint Direct Attack Munition (JDAM) actuator 
design, and uses common parts and manufacturing processes and assembly. The large 
production rate from the JDAM program makes the HR Textron UCAV actuator very attractive 
from an affordability standpoint. 
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Figure 2-4. UCAV Flight Control Surfaces and Actuators 


The UCAV actuator design, shown in Figure 2-5, is an electromechanical design, in which 
two brushless DC motors on a common shaft drive a ball screw to convert rotary motion into 
linear motion. When assembling the ball screw in the actuator unit, small ball bearings are 
placed in between the ball screw and the ball nut. After the actuator is assembled the backlash is 
measured. If the system is too sloppy, the unit is disassembled and slightly larger balls are 
inserted to remove the slop. This is labor intensive and expensive. We would like to assemble 
the unit once and for all, and accept whatever the tolerances are there. Allowing larger 
tolerances in the machining of the ball screw could further reduce actuator costs. Relaxing these 
tolerances will lead to a system that has more backlash, however. 


Thermal expansion effects also complicate the design of the actuator. If lower cost materials 
could be used (that have greater thermal expansion), then costs could be further reduced. These 
materials are also easier to machine, reducing tooling costs. 


The two-DC-motor design shown in Figure 2-5 exhibits the following characteristics: 
e Stall load: 2,500 Ib 

e Max load prior to back drive: 3,500 Ib 

e Max brake hold capability: 5,000 lb (unpowered) 

e 4” and 6” total stroke configurations 

ὁ 6 in/sec no load rate, 4.5 in/sec at 1,500 lb 

e 250 flight hour actuator life 
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Figure 2-5. UCAV Electric Actuator 


e 28 VDC and 135 VDC 


e Peak current: 12 amps per motor (24 amps per actuator) 


e Estimated weight: 10 Ib 

ὁ One Brake, dual coil (Reliability if one fails) 

ὁ Two DC motors torque summed on common shaft 
e Gear reduction to ball screw providing linear output 


This type of EMA actuator exhibits different types of nonlinearity such as dead-zone, 
backlash, and Coulomb/Viscous friction. Our research focused on adaptive compensation to 
remove/negate these nonlinear effects, and produced an actuator controller that has repeatable 
performance from unit to unit. 


Finally, this research program under AFOSR contract addresses fundamental scientific issues 
with a goal of rapid transition to real applications. The combination of “technology push” by 
basic research and “technology pull” by the applied programs provides specific technology 
transition objectives and paths. This work is directly linked to programs at The Boeing 
Company and US Air Force. 
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3.0 ROBUST ADAPTIVE NONLINEAR OUTPUT REGULATION 
- An Application To Structural Mode Filter - 


In this section, we discuss the Internal Model Principle Nonlinear Output Regulation 
(IMPNOR) method that suppresses the effect of harmonic disturbances on the regulated output 
of a system to be controlled. Controllers which are optimal in an LQ senses are derived for the 
suppression of harmonic disturbances for discrete-time, finite-dimensional, multivariable linear 
systems. These controllers are shown to enjoy some of the robustness features concomitant with 
an LQ optimization scheme. 


This work can be applied in a variety of settings, for example in active noise control which 
was studied in [9] for scalar-input scalar-output systems using a minimum variance performance 
measure. The suppression of harmonic disturbances also arises in the active control of vibrations 
in helicopters[6], where a continuous-time model is used in the context of LQ control and 
Kalman filtering. 


Our interest in suppression of harmonic disturbances arose in the development of feedback 
control strategies for next generation aircraft in particular, for certain aircraft it is known that the 
wings exhibit slightly damped flexible behavior which, when excited by a wind gust, produce a 
disturbance which additively corrupts some important velocity measurements. The frequencies 
of the slightly damped (exogeneous) signal generator are known, but the wind gust may be 
modeled as a dirac delta function which forces the exogeneous system into an unknown initial 
state. The control objective is to track a prescribed trajectory while suppressing the disturbance 
produced by the exogenous system. This is a slight modification of the standard problem of 
output regulation, in which the reference trajectory itself is also assumed to be generated by an 
exosystem. As we shall show, this modification is easily accommodated within the present 
framework of output regulation. However, since output regulation ensures some asymptotically 
stability of the error while the tracking and disturbance rejection problem requires a finite time 
horizon in practice, we have assumed the exosystem is undamped and consists of a finite bank of 
harmonic oscillators. 


As part of an ongoing research effort, we are developing a solution to the problem for a 
nonlinear system which incorporates both the rigid body dynamics and aeroelastic dynamics. In 
this section, we illustrate our use of the internal model principle[8] to solve this problem for 
continuous time linear systems. Specifically, in Section 3.2 we introduce the basics of internal 
model based design for a scalar-input, scalar-output system. In Section 3.3, we illustrate this 
design for two particular systems where the desired trajectory idealizes a take-off and landing 
maneuver. In Section 3.4 we give some brief remarks about the internal model principle. One 
remarkable feature of this principle is its ability to systematically produce classical design 
approaches in a variety of particular control problems. In the case at hand, it produces a notch 
filter for the transfer function from the disturbance to the tracking error, with notches, or 
(blocking) zeroes, at the natural frequencies of the exogenous system. Of course, the real power 
of this method is its ability to produce nonclassical, multivariable designs. Indeed, in Section 3.4 
we consider three-input, three-output, minimum phase systems with vector relative degree (2; 2; 
2). Interestingly, the internal model based controller design leads to a Linear Matrix Inequlaity 
(LMI) constraint on the design parameters, yielding a convex problem which 15 easily solved. 
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3.1 The Case of a SISO System 


Consider a linear single-input single-output system having relative degree two and 
asymptotically stable zero dynamics. Any system of this kind can always be put, after a suitable 
change of coordinates in the state space, in the form 


% = Ag+ Bir,4+ Bory 

Zi = 22 

fo = Cze4+Di21+ Dota+ Ku. | 
eS 21 (3.1) 


in which K # 0 and the matrix A is Hurwitz. 


It is well-known that single-input single-output systems having relative degree two and stable 
zero dynamics can always be asymptotically stabilized by means of a proportional-derivative 
output feedback, i.e. by means of a control law of the form 


u=kiy + ky. 


It is also well-known that a proportional-derivative output feedback can be used to solve the 
problem of having the output y(t) to track a prescribed trajectory y,.?), if the latter and its first 
and second derivatives are available in real-time. 


In this section, we describe how the problem of tracking a prescribed trajectory Vrer (4) can be 
solved, for a system of the form (3.1), in case the information about the output y (1.6. x;) 1s noise- 
free, but the information about the rate of change y (i.e. x2) is corrupted by harmonic noise, 1.6. if 
only x, and the quantity 


£2 noisy — 22 +d ᾿ (3.2) 


where d is a superposition of sinuosoidal functions of time that are available for feedback. 


Define 


Ci Y—Uref = Zi Yref 
€&9 = Y-Vref = 170 τ Vref 


in which e; is indeed the tracking error and 6), its rate of change. Then, system (3.1) can be 
rewritten as 


a = Age By(ey ἢ Yref) a Bo(e2 a Vref) 
ὃ] = & 
ὃ = Cz+D (ει ἘΝ Yref) Te Dy(e τ Yret) + KU — Gree . (3.3) 


Let Ζιος be any solution of 


Zref = A2ret + Biyret + Bo ret (3.4) 


and define 


Cz = 2 — ref | 
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Then system (3.3) can be rewritten as 


ἃ; = Aez+ Bye, + Boe 
δι = 82 
ἐλ = Cezt Carat Diles + veer)+ Daler t+ tret) + Κα -- veer. 


(3.5) 
Choose now a control of the form 
tu = U(21, 2a noisy; Vref, Yref, Vref, Zref) | 
= ἀ{-Οχρε — (Di + At ret — Dates + Gre — Di (21 — Yret) — ka(Za,noisy — Yret) + υ] | 


in which v is an additional input to be determined later. With this control u, system (3.5) 
becomes 


ὁ, = Ae, + Bye, + Boer 

ej = €9 _ 

ἐλ = Ce, — χει -- χθ) -- χα τεν, (3.6) 
where 


It is known that d(t), which is superposition of sinusoidal functions, can be put in the form 
d(t) = ¥T w(t)! 


with w(t) solution of the homogeneous equation 


th(t) = Sw(t) (3.7) 
in which S is a matrix of the form 
Sy ὁ - ὃν 
ge 0 S% : | 
0 0 Sef | 


and 


so (2, 8) 


Under our assumption that the zero dynamics is either vacuous or is asymptotically stable, 
we know that any transmission zeros of the plant to be controlled lie in the left half plane. Since 
the eigenvalues of the exosystem (3.7) that generate the disturbance lie on the imaginary axis, we 
see that the standard nonresonance conditions for solution of the regular problem are satisfied. 


Matters being so, the output regulation problem for y,.¢= Ὁ 15 solvable, when the additional 
input v is chosen as the output of an internal model of the form 


é = St-—Che 
v = Ure 


! 
| 
| 
| 
| 
| 
| 
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where G is a vector of free design parameters. We claim that this also yields a solution for 
nontrivial y,. Indeed, introduction of v yields a closed loop system which, changing the 
variable ἕξ into 


x= € — kow 
is described by set of equations 


ἐν = Aez+ Brey + Boe 
i ale, 


6 
é = Ce, + UP x -- χει -- χὰ. 
2 x 161 252 (3.8) 
At this point, it remains to show that, if k;, k2 (or, what is the same, Κ: ), and G is 
appropriately chosen, system (3.8) is asymptotically stable. If this is the case, in fact, we have in 
particular 


Jim ex(t)=0, 
which is the required tracking goal, regardless of the fact that the measurement of y is corrupted 
by noise. 


To provide our stability result, set 
ura (UP WP. φῇ 


and 


P= OF --. of) 


where the partitions indicated are consistent with those of S. 
Then, the following result holds. 
Proposition 3.1: Consider system (3.8). Choose G;,..., G, in such a way that 
Tol =e 
Vi SG; >0, A ee (3.9) 
SN ΣΟ, <1 
at τος (3.10) 
which is always possible. If k; > 0, k2> Oare sufficiently large, the system is asymptotically 
stable. 


The proof of this Proposition is a straightforward consequence of the following Lemma, 
proven for instance in [7]. 
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Lemma 3.2: Consider the matrix 


A 0 0 0 BY. 
0 Sy 0 0 Fil. 
a αι 
0 ὁ . 0 0 Pol. 
If 
PQ; «0 Or le =O, acs? 
Se ee (3.12) 
there exists k* such that, if k > k*, the matrix J is Hurwitz. 
To provide the Proposition, we change coordinates in system (3.8) using 
é, = eg +A Bye, 
X = x-hS Ge 
which yields 
é, = Aé,+(A 1B; + By)ey 
Χ = Sxy- k,S—'Ge, 
ej = €9 
é> = Cé,+ vy - (Ο Α-.1.8, τὐἀᾳ ται vs aye, aa ΕΣ : (3 13) 


1.6. a system of the form 


ἃ τὸ Ja 


with J a matrix having the same structure as the matrix J in the Lemma if we set 


B = (A7!B, +B) 
Os = vu ἢ ΞΞ 1͵ sas: 
P, = --αϑὲ σι i=1,...,7 
On: = —(CA*B, + κι - ψ1.9-10}} 
ΓΤ ἘΞ: ἢ 
k = ky 
Thus, conditions ΟΡ; < 0, fori=1,..., 7, of the Lemma become 


UPS", >0 | 
while conditions QoPo < 0 of the Lemma becomes 


CAB, + (1-7 SG) = CAB, + (1- δ UP STG) > 0. 


t=1 


Indeed, if G is such that (3.9) and (3.10) hold, the conditions of the Lemma can be met for 
large k, > Ὁ and, consequently, for large k2 > 0 the matrix J is Hurwitz. 
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3.2 A Numerical Example of Take-Off and Landing 


In this section we consider some specific numerical examples. In our first example we 
consider the case in which there is no z, i.e. no zero dynamics, and that D; = D2 -- 0, Καὶ = 1 so that 
the plant reduces to the double integrator. For the second example we consider a one 
dimensional stable zero dynamics. 


For both examples we assume that disturbance d is given as a sum of sinusoids with two 
different frequencies, 


a(t) = MV, sin (cy t oy ) + Mo sin(ant is ye). 


In this case we can 


and consider 


Ph anweChose a Ue and UF = (UT ve) so that d(t) = UT w(t). 


Example 3.3: Jn our numerical examples we have chosen “1 = % ἋΣ = 10, 
Ψψί - (Ὁ 2, W,=(0 2) 
0 
ως [28 
5 10 
50 


so that 
d(t) = 50 cos(2t) + 100 cos(10¢). 
Next we choose G1, G2, k; and ζω so that the conditions of Proposition 3.1 are satisfied. 


In the numerical] example we have set k; = 10 and 


0.2 1 
σι = os C2 = (4) 


which gives 


UP s7'G,=02, WiSy'G, =02 


and 


UP ΓΟ. πἰ Το; ΤΟ, τ 4 «1 


so the conditions of Proposition 3.1 are fulfilled. 
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We proceed to chose k2 > 0 so that the matrix J in (3.11) is Hurwitz. Our choice of k2 will be 
based on a root locus design. Since we do not have ἃ z component in this example the matrix J 
can be reduced by deleting the first column and the first row. Then we set k2 = k = 0 and define 
the resulting matrix to be Jo, 1.e., 


Ss - 0 0 F 


Then we define matrices 
BT=(0 0000 1), C=(0 00 0 0 1) 


and consider the system (J,,B,C) with feedback law wu = -k2y and transfer function 


( (51 18. 


Notice that 


ἢ 


J = Jo - 86, 
and, more importantly, the eigenvalues of J are the closed loop poles of this system. If k; > 0 our 
Proposition predicts that for large values of k2 the matrix J is Hurwitz. For k; = 10 we plot the 
locus of the roots of the characteristic polynomial of J, viewing kz as a gain parameter. For large 


kp, five of the 6 roots should approach five (zeros) on the imaginary axis, at 0, +27, +/0i, while 
the 6-th one goes to - 00. 


a= 4, a= VU, K= 3U, k= Y.Ud 


-3.5 -3 -25 -2 -15 -Ἵ -05 0 


Figure 3-1. Plot of Root Locus 
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In this example we take a simplified model of aircraft take-off and landing with a reference 


signal given by 


Ot. 
100, 
300 — 2t, 


Yref(t) = 


0 50 100 150 
time 


(left) plot of Yrer 


O<t< 50 
50 «ὁ < 100 
100 «ὁ < 150 


time 


(right) plot of x2, = x,+d 


Figure 3-2. y,.,;and x2, in Example 3.3 


Notice that γε, 1s not differentiable at t = 50 and t = 100 as it would actually be in practice. 
Indeed we have not actually used γι as given above but rather we have used a Hermite 
interpolation to round the corners at these points. Since it serves no particular good to produce 
these more complicated formulas we do not present them here. 


0 50 100 150 
time 


(left) plot of 6, = x," — γον 
Figure 3-3. 


15 


0 50 100 150 
time 


(right) plot of x, and Yrer 


@1, X1, ANd Yrer in Example 3.3 
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Example 3.4: In our second numerical example consider a system with a one dimensional stable 
zero dynamics. In this case we have set A =-1, C= 1, D, =1 and D,= 1. The relative values of 
B, and B; play a large role in the location of the closed loop poles so we present a “root locus 
plot” in two different cases: By = 1/4, By = 1/2 and B, = 1/2, Bz = 1/4. Also for this example we 
compute the reference zero dynamics trajectory Z,o¢ satisfying (3.4) for a given initial condition 
Zro(0) = 5. Once again we have set a) = 2, a2 = 10, and we have taken 


We(i i), wWe=(1 ἢ 


wig = 


so that 
d(t) = 25/2 sin(2t + 1/4) + 10-V2 sin(10t + 77/4). 
Next we chose G1, G>, k and kz so that the conditions of Proposition 3.1 are satisfied. 


In the numerical example we have set 4,= 20 and 


0.2 1 
G1= 4 os a 


which gives 


and 


UTS 'G, 4+ 0P97'Gp=4<1 UTST'G,=02, WFSy'G, =0.2 
so the conditions of Proposition 3.1 are fulfilled. 


Just as in the previous example we choose kz > 0 so that the matrix J in (3.11) is Hurwitz. In 
the present case more care must be taken in making our choice for k2 as is seen in the root locus 
plots in Figure 3-4. Just as in Example 3.1, if we first set kp = k = 0 in J and define the resulting 
matrix to be JO, 1.e., 


A 0-0 0 8B 
0S; - 0 0 F 
Jp . 4° 4 ’ 


ll 


OO 
> 
= 
> 
Θ 


| 


Then we define matrices 
Β΄ .--( 0600 000 1), €C=(000000 1) 
and consider the system (Jo, Β, C) with feedback law u = -k2y and transfer function 
66] — Jo) 1B. 
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Notice that 
J =o — ko BC, 


and, more importantly, the eigenvalues of J are the closed loop poles of this system. 


-3.5 3 75 κ᾿ ΤΕ 7 05 0 -3.5 3 35 2 75 A 05 0 
(Left) B = (A"B, + Bz) > 0, Κι = 15 (Right) = (ΑΒ. + Bz) < 0, Κι = 15 
Figure 3-4. Root Locus in Example 3.4 


Unlike the first example more care must be exercised in choosing k, and kz. As an example, 
if we choose k; > 0, wet A= -1, C = 1, then the sign of the real parts of the eigenvalues of Jo 
depend on whether B = (4 Δι + Β2) is greater than or less than zero. 


If k, > 0 our Proposition still predicts that for large values of kp the matrix J is Hurwitz. For 
k; = 15 and for B, = 1/4, By = 1/2, B= (A''B, + Bz) > 0 for which the some of the open loop poles 
are in the right halfplane while for k; = 15 and for B) = 1/2, By = 1/4, B=(4 'B\ +B) < 0 all open 
loop poles are in the left half plane for large k2, five of the 7 closed loop poles approach the five 
(zeros) on the imaginary axis, at 0, +2i, +10i, while the 6-th one goes to —co and the 7-th 
approaches the zero at —1. 


In this example we take a simplified model of aircraft take-off and landing with a reference 
signal given by 


yA I 0<t< 50 
Yret(t) = ¢ 100, 50<t< 100 . 
300 —2¢, 100 « ὁ « 150 
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0 50 100 450 
time time 


(Left) x2,=X2.+d (Right) e,(t) = x;(t) -Vree 


Figure 3-5. x2, and 6: in Example 3.4 


0 50 100 150 0 50 100 150 
time time 


(Left) x, and Vrer (Right) z 
Figure 3-6. X14, γιει, and z in Example 3.4 


3.3 Remarks on the Internal Model Principle 


One of the remarkable features of the Internal Model Principle is its ability to generate, in a 
systematic fashion, classical control designs. More importantly, it can also be used in 
multivariable, nonlinear and infinite-dimensional contents. 


As a first illustration of this point, consider a relative degree 1, minimum phase scalar-input 
scalar-output system 


g=Az+By 
y= Cxrt+ Dyt+ ku | 


where ynoisy = y + d with d(t) being an unknown constant. Setting 
ΘΟ Y — Vref 
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we obtain the relative degree one analog of (3.3)-(3.6). In this case, the constant disturbance 
takes the form 


d = vw 
where w is the state of the exogenous system 
w= 0. 
This generates the internal model 
é = —Ge 
v= pe 
which is a proportional-integral (PI) controller. Stability of the closed loop system 
b= Aert Be | 
xX = —Ge 


$= Ce yee 


is guaranteed, whenever WG > 0 by Lemma 3.2 (see [7]). 


As a second illustration, consider the system described in Example 3.1. The disturbance is a 
sum of two sinusoids having frequencies a, = 2 and a2 = 10. Figure 3-7 below depicts the Bode 
magnitude and phase plots of the closed-loop transfer function from the disturbance to the 
tracking error, obtained by using an internal model based design. The magnitude plot 15, of 
course, that of a typical notch filter having notches (or transmission zeroes) at precisely 2i and 
10i, reflecting the fact that the filter absorbs sinusoids at those frequencies. 


Bode Diagrams 


From: U(1) 


Phase (deg), Magnitude (dB) 


Frequency (rad/sec) 


Figure 3-7. Bode Plots of the System in Example 3.7 
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3.4 The Case of a MIMO System 


In the following we briefly sketch how the analysis developed in section 3 can be easily 
extended to deal also with multi inputs-multi outputs systems. We consider the class of 
minimum-phase linear systems with three inputs and three outputs with vector relative degree [2, 
2,2]. As is well-known, this class of systems can always be put, after a suitable change of 
coordinates, in the normal form 


2 = Azt+ Bir, + Boxe 

Ly = 9 

g = Cz+ Diya, + Doz + Kw 

δι. (3.14) 


where z€ IRr 
ΠΩ 8 Τ 8 
i= (21 2 Zig) ER f= ( 291 12) 125} ER’, 


με IR? is the vector of control inputs and z€ IR° of controlled outputs, with the matrix A which is 
Hurwitz and the high frequency gain matrix K€ IR? x IR° which is non singular. Our goal is to 
force the output vector y to asymptotically track a vector of known reference signals. 


-».-. 
Yref = ( τοῦ Yref,2 Yref,3 ) 


with the design of a proportional-derivative output feedback control law. Similarly to section 4 
the challenging aspect of this apparently simple tracking problem is given by the presence of 
additive noise on the velocity measures. More precisely we suppose, as above, that the measure 
of the position vector x, is noise-free, but the measured velocity vector is corrupted by additive 
harmonic noise, namely 


29 noisy(t) = g(t) +d(t) with d(t)=(d(t) Gf) &(t) )* 


where the signals di, i= 1,2,3, are given as superposition of sinusoidal functions of time. In 
particular the signals di(t), i= 1, 2, 3, are thought as 


where δ᾽ is a block-diagonal square matrix of dimension 2r; which describes a bank of r; 
oscillators at different frequencies. 


By mimicking the design methodology of the SISO case we define by Ζιορ any solution of 


Zref = Aref a Bry ret - Boy ret | 


and we choose the preliminary control law 


UU = | ae (—C Z ref re Diz = Day ret ar BY ia aR v) 


(3.15) 
where v is a vector of residual control inputs. Defining 
Εἰ = Y-Vree = 1 Vref 
€2 = ¥-Vret = 2 Vref 
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and z = Z — Zrer, Simple computations show that system (3.4) under the feedback (3.15), in the 
new error coordinates, reads as 


2 


= Ag+ Bye; + Boe 
δῚ = €9 
€5 = Cz4+ Doeg +v (3.16) 


The design of the vector of inputs v able to asymptotically stabilize the system (3.16), and 
hence to solve the tracking problem can be easily carried out by means of a three-steps procedure 
in which, at each step, just one control variable v; designed, following exactly the design 
methodology illustrated in the SISO case. 


To sketch how this procedure can be carried out consider, for the first control input v;, the 
following dynamic law 


é = ἐδ, -- Grkisen 
vy = —ky1 (£11 — Yret.i) — hoi (Cat noisy — Yref) + Ψ ξΙ 
= —hyye1, — και + Vids — ho Yiu (3.17) 


in which k;;, k>; and the entries of Ο᾽ are design parameters. Simple computations show that, 
having defined 


1 = &1 — hoy wi , 


system (3.16) with the partial control law (3.17) can be rewritten as 


ζ = AZ + Bie; + Boe 


Sia ςς 
δὰ = Cx+Do@.4% 3.18) 
where 
z= (2 χι eit eat " δι τε (42 es ἢ τε (υλ Us τι 
the matrix A is defined as 
A 0 i Bi 
Ἅ -- 0 88, —Gikiy 0 
~ | 0 0 0 1 


Ο' Ὁ τῶν  — (kar — Dy) (3.19) 


and B,,B,,D,,D,,B;,B,,C’,D, are suitably defined matrices 


Now it can be easily realized that it is possible to chose ki), k21, and G; so that the matrix A 
is Hurwitz. To this end just note that the matrix characterizing system (3.8) has the same 
structure of (3.19). Hence, by Proposition 3.1, we can claim the existence of a vector G; such 
that for suitably large k,,; and k22, the matrix A is Hurwitz. 


From this, once ΟἹ, k;; and k2 have been fixed, the procedure can be iterated in order to 
design v2 and v3. As a matter of act the system (3.18) exhibits the same structure of the original 
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error system (3.16), with the state variables Ἐ:, an Z which replace e;and Z , with the vector of 


control input v which replaces v, and with the Hurwitz matrix A which replaces A. This allows 
us to design also v2, and in the third step v3, using the same procedure used for the SISO case and 
come out with a proportional-derivative stabilizer of the form 


é = ¢-GKie 


vy = —Kye,;— Koeet VE (3.20) 


G, 0 0 v;, 0 0 
G= 0 ὦ 0 v= 0 WV. 0 . 
0 0 Gs 0 0 Fs 


Hence the overall control law is obtained from the composition of (3.15) and (3.20) 
Example 3.5: 


In our 3-dimensional numerical examples we have chosen ay = 2, a2 = 3, a3 = 4, 
Ul? — Pp? — yp? = (2 9): 


0 
20 
Wo = 0 
50 
so that 
5(sin(a,t) + cos(a;t) 
d(t) = | 10(sin(agt) + cos(agt)| . 
5(sin(a3t) + cos(azt) 


Next, we choose G), G2, G3 Κι and kp so that the conditions of Proposition 3.1 are satisfied. 
We have also set D; = D2 = 13x3. 


In the numerical example we have set k; = 10 and 


which gives 


UP SG, =1/5, WFSy'Go=2/8, WeSy'Gs = 1/10 


22 GP49152 


Robust Adaptive Final Report 
Control of UCAVs BOEING - STL 2004P0037 


and 


8 
UT SIG, an U2 55° Ge of UF S31Gs = τε «1 


so the conditions of Proposition 3.1 are fulfilled. 


From the theory developed in Section 3.1 we see that the closed loop system can be written 
as X= JX where 


X= ej ER”, 


€9 


and, just as in Example 3.3, since there is no z component, the matrix J can be reduced by 
deleting the first column and first row, to obtain 


S -ΟΚ, O63 
J= 


03x6 98x3 3x3 
Ψ -Κ, —(Ko—-D)) 


Since this is a MIMO example we cannot directly appeal to root locus methods but if we set 
Ko = kolgx3, and set k = (kg -- 1) 
then it is still simple to compute a “root locus type plot” of the closed loop poles as functions of 
the scalar parameter k > 0. 
Namely, we define the matrix to be Jo by 
S —-GK 1 Ogx3 
Jo= [0gx6 Osxg  Igx3} 
Vo —Ki 08x 


Then we define matrices 
BT =(0ix9 11 1), C=(O1xg 1 1 1) 
so that 
J=Jo—kBC 


and consider the eigenvalues of J as the closed loop poles of this system. 


For k; = 10 we plot the locus of the roots of the characteristic polynomial of J, viewing kp (or 
k) as a gain parameter. : 
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Figure 3-8. Plot of Root Locus 


In this example we track a three dimensional reference trajectory 


f 3nt πὶ ᾿ 
re το δ σοι : ref 
yo) = | 10 cos (=) 10 sin (5) Ye (| 


where 
f, 0<t<40 
ys'(t) = ¢ 40 + 5sin (ee) , 40<t < 100 
—(t — 150), 100 < ὁ < 150 


Figure 3-9. Xa1n =X 24 + G4, X22.n = X 22 + da, X23,n = X23 + dy 
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25 5D 400 150 ars XD 100 150 tp 50 100 150 
tire time time 


Figure 3-11. y™ x, and y™ 
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4.0 ROBUST ADAPTIVE NONLINEAR OUTPUT REGULATION 
- An Application To UCAV Flight Control - 


In this section, we discuss an application of Output Regulation methods to UCAV flight 
control. For this purpose, Neural Networks and Internal Model Principle Nonlinear Output 
Regulation (IMPNOR) methods are considered. These two methods are applied to the UCAV 
X-45A linear model independently, and the results are compared. We start with the plant model. 


4.1 Plant Model 


Beginning in this section, adaptive model reference augmentation of a baseline controller is 
introduced using the UCAV X-45A model, as an example. It should be noted, that the method is 
not restricted to this particular model, and that it is applicable to any plant dynamics that is taken 
to be of the form: 

i, =A, x, +BGA(5+K,(x,))=4, x, +B, A(5+K,(x,)] (4.1) 


—— 


B, 


In (4.1), using the UCAV X-45A flight dynamics nomenclature, 
T 
x,=(@ BP p qr) (4.2) 


is the ἡ, =5 — dimensional state vector that consists of angle of attack a@ , angle of sideslip , 


body roll rate p , body pitch rate g , and body yaw rate r. The vector of virtual controls δ᾽ is 


(mx1) — dimensional with m=3, G is the (Mxm) — control allocation matrix, A is the 


(mxm) — unknown constant diagonal matrix with positive diagonal elements, and 
T 
GAd= (S,08 Ome 1m One Onwe Orop δ᾽...) (4.3) 


is the Μ--7 -- dimensional vector of actual control inputs, (positions) that consists of 6 elevons 
and a thrust vectoring. Also in (4.1), A, isa (nxn) — constant known matrix, B isa (n, xM ) 


-- constant known matrix, B, is ἃ (n, xm) -- constant known matrix, and K, (x, ) is the (mx1) -- 


unknown state-dependent vector. Note that while A models unknown control failures, the 
vector-function K, (x a represents matched unknown nonlinear effects. 


Vertical acceleration a, (positive down) is computed as the system (4.1) output signal. 


a, =C, x, +DGA(5+K,(x,))=C, x, +D, A(d+Ko(x,)] (4.4) 
D, 


In addition, it assumed that the entire system state x, is available for control design 


purposes. 
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4.2 Baseline Inner-Loop Controller 
Dynamics of the UCAV X-45A baseline inner-loop controller is defined as follows: 
X,=A,x,+B,.x,+B,.u (4.5) 


lc""p 


where 
T 
x, =(q, Pr ἢ r,,) (4.6) 


is the ἡ, =4 — dimensional controller state vector that consists of pitch integrator q,, roll 
integrator p,, yaw integrator r,, and yaw rate washout filter signal r,.. The vector of inner-loop 
commands 


u= ia por ΠΩ pond "; (4.7) 


is comprised of n, = 4 commanded signals: vertical acceleration a’ , sideslip 8°", roll rate 


p°™ , and yaw rate γα, Combining (4.5) and (4.7) with (4.1), the corresponding augmented 
plant-controller system becomes: 


x, A, 0 Xp B, 0 
= + +K + 
δὼ δι 4 Ἢ | 0 τ 0(%)) B,, 
ean ae a ae Da engin Ca rei 

x A x 8, B, 

or equivalently 
χε Ax+B,A(5+K,(x,)]+B,u (4.8) 
Feedback/feedforward gains for the UCAV X-45A baseline (nominal) inner-loop controller 


are designed assuming no modeling uncertainties, A=J,,, K, (zx, | Ξε θ κι » and using the LOR 


method with projection [16, 17, 18, 19, 20]. The corresponding inner-loop control system takes 
the form: 


6,=K!'x+Kiu (4.9) 


where K, and K, denote the (nxm) and (n, xm) — nominal feedback and feedforward gain 
matrices, correspondingly. The number of virtual controls is m=3 and n=n, +n, represents 
the order of the extended system (4.8). 


Nominal inner-loop feedback (4.9), when applied to the ideal (1.e., without uncertainties) 
model (4.1), naturally yields the desired (reference) dynamics. 


Xrep = (A+B, KI?) Xp +(B, +B, ΚΙ )U= Ary Xpey + Brey Ὁ (4.10) 
Aref Bef 
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Assumption 1: Reference model matrix 4,,, is Hurwitz, (1.e., stable). 


The inner-loop control objective is to design state feedback control δ᾽ for (4.8) such that 
despite the system uncertainties A and K, (x, | , all closed-loop signals are bounded and the 


system state x tracks the reference model state x,,,. 


To this end, it should be noted that the reference model (4.10) was written using the baseline 
feedback gains designed for the nominal (i.e., no uncertainties) X-45A model. In order to 
achieve the inner-loop control objective for the system (4.8) in the presence of modeling 
uncertainties and unknown control failures, a direct adaptive model following control 
architecture will be designed. 


4.3 Inner-Loop Tracking Error Dynamics 
4.3.1 Internal Model-Based Controller 
Now define the tracking error to be regulated as 
€C=X—Xref 


and seek an observable (in the first approximation) controller with state § and output Κα 
of the form 


E = 7] (F5€,U ng ) 
ὃ =k (é,e) 


such that lim e(f)=0 and & remains bounded. From Eqs. (4.8) and (4.10) 
too 


é= A (A)e+ B,(A)x 


é = 1) (65€,U ma ) 
δ =k (é,e) 


where Ao(A)=A+B, AK’,, Bo(A)= B, (A-I) K'x, Bu(A)= B; (A-D) K',, and the control 
ὃ = κ(ξ, 6) is chosen such that the interconnected dynamics are Hurwitz in the first 
approximation. In addition, the control scheme must ensure that the nonlinear pitch break 
dynamics do not perturb the system away from some bounded set near e = 0 for failures under 
consideration. The choice of internal models 1s 

S, =F Aig: + BU ema a Nie 

6,=T.é+K,e+K,,sgn(e, )e,” j=1,23 

ὃ - ἰδ ι ὁ, ὃ. " 


+ BA[O+ Ky (6 Ἐχ,,.,.}Ὲ B, (AM ona 


ref 


where the nonlinear conditioning term is chosen so as to bound pitch-break dynamics based on 
the knowledge that the pitch-break phenomenon is quadratic in angle of attack. The gain-matrix 
entries on the nonlinear error feedback terms are chosen large enough to bound the pitch-break 
dynamics, but not so large that excessive control effort is produced. The form shown 


Κι, sgn(e, Je,” 
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gives a ‘cubic’ stability benefit with a ‘quadratic-in-the-error’ time-varying magnitude. The 
linear error-feedback gain-matrices Kj; and Nj are chosen to make Ap Hurwitz and to make the 
domain of attraction of the error large enough to bound coupled error-reference terms that are 
introduced by the quadratic pitch-break dynamics. The role of [;§ is to approximate the residual 
purely reference terms. This controller gives bounded error tracking for the closed inner-loop 
dynamics for the failure cases considered. The outer-loop problem was not considered, although 
the inner-loop control is sufficient to keep the outer-loop dynamics well-contained for inner-loop 
commands. 

The size of the neighborhood can be determined via Lyapunov analysis. Note that this 
controller yields approximation regulation because the pitch-break phenomenon introduces a 
term that is proportional to x’ o¢ that is not directly accounted for in the design of the IMPNOR 
scheme. However, this term does not adversely affect the steady-state solution, and, 
consequently, was not taken into account. Although, it is possible to realize a higher order 
IMPNOR scheme that contains a copy of the dynamics associated with this quadratic reference 
signal. Simulation results are provided in Section 4.6.2 for the Inner Loop and in Section 4.9.2 
for the Outer Loop. 


4.3.2 Adaptive Control Set-up 


Inner-loop tracking performance can be quantified by the corresponding tracking error 
signal: 


C=X—- KX, (4.11) 
Similar to [13, 14, 15], direct adaptive control input is formed: 


6=K' x+K'u-K,(x,) (4.12) 


In (4.12), K., Κ,, and Κ, (x, ] represent the adaptive feedback/feedforward matrix gains 
that will be determined through the Lyapunov stability analysis. 


Using (4.12), the corresponding closed-loop system dynamics becomes: 
x=Ax+B A(KT x+K]u-K,(x,)+Ko(x,))+B,u 
=(A+B, ART) x+(B,+B, ART \u+B, A(-K,(x,)+Ky(x,)) (4.13) 
ὄξος ον Ξε 


NETO 


A B 


Comparing system (4.13) with the reference model dynamics (4.10), the following 
assumptions are stated. 
Assumption 2: For a constant unknown diagonal matrix A anda constant control allocation 
matrix G , there exist "true" gains Α΄, K, such that the so-called uncertainty matching 


conditions take place: 


aur 
A, = A+B, Λ(ΚΙ) asa 


B 


oy = By +B A(Ki). 
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Remark 1: Knowledge of the true parameters Κ᾿, ΚΙ will not be required for adaptive control 
design. Only their existence 15 assumed. 


Inner-loop tracking error dynamics is obtained by subtracting (4.10) from (4.13). 


@é=Ax-A, Xf +(B-B,,,] u—B, A Κι (x, )-K, (<x, | 
ee ee 


aes (4.15) 
=A,-e+(A-A,, )x+(B-B,)u—B, A AK,(x,) 
Using uncertainty-matching conditions (4.14), the tracking error dynamics becomes: 
é=4,,e+B,A|(K,-Ki) x+(K,-Kj) u-AK,(x,) 
——— (4.16) 
AK, AK,, 
= A, e+ B, A(AK? x+AKy u—AK,(x,)} 


To this end, we approximate the uncertainty K, (x, } using multi-input-multi-output 


feedforward neural network (NN) with N, radial basis function (RBF) neurons in its inner layer, 
[22, 23]. The network computes m linear combinations of a suitably chosen set of radial basis 


functions {,(x,)}" , [24]. 
is 


Κι (χ,)Ξ a - Θ΄ ®(x,) (4.17) 


The RBF NN Universal Approximation Theorem [21] states that given an approximation 
tolerance εἰ >0, and a compact set X c Κ΄, there must exist an integer N, and a "true" 


constant matrix ΘΕ R*”” such that for all χ, ε X CR": 
K,(x,)=0" (x, )+&(x,) (4.18) 
and 


f6o(%) 


In other words, given enough neurons, (i.e., RBF functions), one can approximate a nonlinear 
function to within any accuracy. 


< ξ' (4.19) 
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Using neural approximation (4.18), we get 


“w 


ky (x,)= Kol s,)=Ko(x,) =(6-0) φ()- 60) 4.20) 


In (4.20), AQ =O-0, and E, (x, ] , represent unknown parameter estimation errors, and 


uncertainty approximation errors, correspondingly. Substituting (4.20) into (4.16) yields: 


ἐξ Ay e+ B, A(AK! x+ AK) u- AO" ®(x,)+&,(x,)] (4.21) 
or, equivalently 
xX 
é= Ay e+B,A\ (AK? AK] ΔΘ} u  |+e,(x,) (4.22) 
AQ’ -(x, ) 
ἘΦ τ 
Ψῴ, u) 


Hence, the inner-loop tracking dynamics can be written in the form: 


é= A, e+ B, A(AQ! ¥(x,u)+4,(x,)} (4.23) 
4.4 Inner-loop Stability and Adaptation 


Without a loss of generality, it is assumed that the uncertain matrix A has strictly positive 
diagonal elements. To this end, consider the following positive definite Lyapunov function 
candidate: 


V (e,AQ) =e" Pe+trace(AQ"T™ AQ A) (4.24) 


where trace of a matrix is defined as the sum of its diagonal elements. Given a symmetric 
positive definite matrix Q, let P denote the unique positive definite symmetric solution of the 


Lyapunov equation: 


Ay P+PA,,=-Q (4.25) 
Also note that in (4.24) Γ᾽ is a positive definite diagonal symmetric matrix. 
r. 0 0 
r=|0 YT, 0 (4.26) 
0 0 T, 
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Computing time derivative of the Lyapunov function candidate V along the system (4.23) 
trajectories, we get: 


V (e, AQ)=é" Pete’ Pé+2trace(AQ’T' QA) 
=(4,,e+B, A(AQ™Y(x,0)+& (x,))) Pe a 
+e" P( App e+ B, A(AQ! ¥ (x0) +& (x, )))+2erace(AQ’T QA) at 
-- οἵ Qe+2e" PBA AQ’ ¥ (x,u)+2trace(AQ’T!QA)+2e" PB, Ag, (x, ) 
Applying the trace identity x’ y= trace( yx") to the second term in the right hand side of 
(4.27), yields: | 
V (e,AQ)=-e" Oe+2e" PB, Aé,(x,} 
+2 trace( AQ” V(x,u)e’ PB, A) + 2 trace( AQ’ Γ΄ QA) (4.28) 
=~e" Qe+2e" PB Ae,(x,)+2trace(AQ’ (Y(x,u)e” PB, +T'Q)A) 
Based on (4.28), we choose adaptive laws: 
Q=T Proj(Q, -- (χ,) 67 PB, | (4.29) 
where Proj(-,-) denotes the projection operator, [25]. It ensures that the matrix of adaptive 


parameters Q does not exceed its pre-specified norm bound Q,,,, , and at the same time negative 


semi-definiteness of the Lyapunov function is maintained. Due to (4.29), an upper bound for the 
time derivative of the Lyapunov function can be found. 


V(e,AQ) « -εἶ Oet+2e' PB A&, (x, ) 
<A (Oe + elf? Bmax) 430 
=—[el(Anin (0)}6};-2} Βι| πιαχ (A) 


Consequently, the Lyapunov function derivative V (e,AQ) becomes semi-negative outside 


* 


é) 


of the compact set: 
lel < 2||P B, | max (A) & 
Z= λ΄. (Q) (4.31) 
OQ] ΞΩ μὰ 
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Thus, adaptation (4.29) yields bounded tracking. Moreover, due to the diagonal form of 1", 
the adaptive laws can explicitly be written as: 


ier Proj(K,, —xe "PB, 
car Proj(K,,-we" PB,} (4.32) 
O=T, Proj(®, ®(x, Je” PB,} 


4.5 Adaptive Augmentation of the Baseline Inner-Loop Controller 


Since the structures of the baseline inner-loop controller (4.9) and of the adaptive feedback 
(4.12) are similar, the former could be imbedded into the latter. 


6=6,(x,u)+K' χα ΚΤ u-6, (x,u)-K, (<x, | 


A T A T A 
= δι (x,u)+(K,-K, ] x+(K,-K,) u-K,(x,) (4.33) 
Ως St—SSS*~<CS~Ss«R Le) 


a ΓΤ rT.) AT 
= 6, (x,.x,.u)+k, x+k u-O ®(x,) 
Vee” 
nominal adaptive 


In other words, the adaptive system augments the baseline inner-loop controller. In this case, 
adaptation of the incremental feedback gains k,, k,,, © starts from zero initial conditions and 
has the following dynamics: 


= Proj(k,,-xe" PB,), ς. (0) Ξ 0 
k, =I, Proj(k,,-ue" PB), k,(0)=0 (4.34) 
6=T, Proj(O, &(x,)e" PB), O(0)=On p.m 


Note that in (4.34), the projection operator depends on two vector arguments and is defined 
column-wise: 


Τ 
Proj(6,y) = or?! (0), if {f(@)>Oand γῖ Vf(8) > 0} τ 
y, 1f not 
The corresponding convex cost function and its gradient are given below: 
)=[pf 22 
eg (4.36) 
Vf(¥)= iyo 


» 


33 GP49152 


Robust Adaptive Final Report 
. Control of UCAVs BOEING - STL 2004P0037 


In (4.36) y,,,, denotes the norm upper bound imposed on the parameter vector y, while €, 


defines the projection tolerance. Detailed description of the projection operator is given in 
Appendix A. 


Designed to yield bounded tracking, the adaptation process (4.34) has the ability to not only 
change/reconfigure the nominal feedback gains K, and K, , but also to identify and counteract 


the nonlinear matched uncertainty K, (x5) in the system (4.1). As a result, bounded tracking of 


the reference model trajectories is achieved in spite of unknown control failures and unmodeled 
nonlinear dynamical effects. The corresponding closed-inner-loop system block-diagram 15 
shown Figure 4-1. 


a Xref ΤᾺ X= (a BP ray Pyrat)” 


model 
e 


¢ Adaptive 


Ζ 
[ὁ 
Control 
(+) Allocation - Actuators Ac b> Sensors Χ 


Baseline inner-loop 
controller 
U= (acm Remd pom rome) GP44275017.ppt 


Figure 4-1. Cilosed-Inner-Loop System Block Diagram 
To this end, it should be pointed out that the adaptive system (4.34) was derived using RBF 
NN approximation (4.18). An alternative approach that uses Single-Hidden-Layer (SHL) 
feedforward neural networks with sigmoidal activation units instead of the radial basis functions 


is presented in Appendix B. The two neural networks, RBF NN and SHL NN, are 
complimentary in their approximation abilities [21]. 


4.6 UCAV X-45A Inner-Loop Design and Evaluation 
4.61 Neural Networks-Based Controller 


The adaptive augmentation approach illustrated in Figure 4-1 was utilized for the design of 
the UCAV X-45A adaptive reconfigurable inner-loop controller. For the design purposes, open- 
loop X-45A aircraft data (i.e., plant) were taken to represent low speed dynamics at Mach = .3, 
Altitude = 5K ft, and angle of attack AOA = 6 degrees. 


Baseline inner-loop feedback / feedforward gains in (4.9) were designed using the LOR 
method with output projection, [17, 18, 19, 20, 25]. 


Using relation (4.10), the reference model matrices 4,,, and B,,, were computed. It was 


verified that the reference model matrix 4,,, was Hurwitz, (1.e., Assumption | was satisfied). 


While the 1" and the 3 components of the modeling uncertainty K, (x, } were set to zero, 


its 2" component was chosen to represent the so-called “Pitch Break phenomenon” that 
depended solely on angle of attack, (AOA). Figure 4-2 shows the corresponding data. 
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Figure 4-2. Pitch Break vs. AOA data 


Radial basis functions were chosen in the form of Gaussians: 


7 2 ey ree 
aA-a,) +B+p +q + 
a Bi th Sol 


AOA break points [α,} 19, Were spaced evenly between —5 and 5 degrees, and half a 
J=l,..5 


degree apart from each other. The Gaussian widths were set to σ᾽ =0.5. The latter provided for 
a reasonable overlap between the individual basis functions. Also, a unity basis function 


Dy, (x, ) =1 was added to the RBF NN in (4.17), (N, =22). 
The adaptation rates in (4.32) were chosen to be: 
r,=100diag((0 0 1 1 1 0 0 0 OJ) 
r, =0 
Γρ =100eye(N, ) 
where “diag” and “eye” denoted diagonal and identity matrices, correspondingly. Note, that in 
general the Lyapunov function in (4.24) requires nonsingular matrices of adaptation rates. 
However, since the inner-loop dynamics primarily depends on body angular rates, the rates of 


adaptation chosen above are adequate, and the rest of the system dynamics can be treated as a 
modeling uncertainty that is approximated and cancelled on-line by the RBF NN. 


Positive semi-definite solution of the Lyapunov equation (4.25) was obtained by choosing 


Q=diag([(0 0 25 2.5 25 0 0 0 Oj} 
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In order to constrain the inner-loop adaptation parameter values in (4.34), column-wise norm 
upper bounds y,,. in (4.36) were set to unity, and projection tolerances €, were chosen to be 


10% of the norm upper bounds. 


Closed-inner-loop system performance was evaluated in the UCAV X-45A flight simulation 
environment that included 1° order actuator models with a time constant as well as CrowMix -- 
the X-45A nonlinear control allocation logic. Various control failure scenarios with and without 
the pitch break phenomena were tested. Simulated performance with adaptation was compared 
to the performance provided by the baseline inner-loop system without the adaptation. The 
simulation evaluation results are shown in Figures 4-3, 4-4, and 4-5. For comparison purposes, 
simulation data were obtained from the following three closed-inner-loop systems: a) Adaptation 
OFF / Failures OFF (blue), b) Adaptation OFF / Failures ON (red), and c) Adaptation ON / 
Failures ON (black). 


Figures 4-3a, 4-3b, and 4-3c demonstrate adaptation benefits when the right outboard (ROB) 
elevon fails at 1 second into the maneuver. Figure 4-3a indicates that in spite of the unknown 
control failure, the adaptive system is able to quickly reconfigure and track the commanded 
vertical acceleration, sideslip angle, roll rate, and yaw rate signals, simultaneously. In fact, 
Figure 4-3a shows that with the adaptation turned on the desired/nominal system tracking 
behavior has been almost recovered. In addition, Figure 4-3b compares the three virtual control 
feedback signals, and Figure 4-3c confirms that the level of control activity is reasonable and that 
no control saturation has occurred during the adaptation process. 
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Figure 4-3a. Inner-Loop Adaptation with ROB Elevon Failure: Command Tracking 
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Figure 4-3b. Inner-Loop Adaptation with ROB Elevon Failure: Virtual Controls 


Time, sec 


Figure 4-3c. Inner-Loop Adaptation with ROB Elevon Failure: Controls 
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Figures 4-4a, 4-4b, and 4-4c demonstrate adaptation benefits when in addition to the ROB 
elevon failure the Pitch Break phenomenon is active throughout the entire maneuver. As seen 
from the data, the adaptive process removes the unwanted oscillations and quickly “learns” to 
cancel the unknown Pitch Break nonlinear phenomena. As expected, all of the control signals 


remain well within their position and rate limits. 
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Figure 4-4a. Inner-Loop Adaptation with ROB Elevon Failure and 
Pitch Break Phenomenon: Command Tracking 
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Figure 4-4b. Inner-Loop Adaptation with ROB Elevon Failure and 
Pitch Break Phenomenon: Virtual Controls 
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Figure 4-4c. Inner-Loop Adaptation with ROB Elevon Failure and 
Pitch Break Phenomenon: Controls 
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Figures 4-5a, 4-5b, and 4-5c demonstrate adaptation benefits when the inner-loop system 
receives repetitive vertical acceleration commands in the presence of both the ROB elevon 
failure and the unknown Pitch Break phenomenon. The data in Figure 4-5a show that while the 
nominal system provides consistent (oscillatory) performance, the adaptive process quickly 
dampens the oscillations caused by the uncertainties and significantly reduces the corresponding 


control activity. 
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Figure 4-5a. Inner-Loop Adaptation with Vertical Acceleration Commands, ROB Elevon Failure, 
and Pitch Break Phenomenon: Command Tracking 
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Figure 4-5b. Inner-Loop Adaptation with Vertical Acceleration Commands, ROB Elevon Failure, 
and Pitch Break Phenomenon: Virtual Controls 
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Figure 4-5c. Inner-Loop Adaptation with Vertical Acceleration Commands, ROB Elevon Failure, 
| and Pitch Break Phenomenon: Controls 
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Adaptive inner-loop system performance was also tested in the UCAV X-45A high fidelity 
nonlinear 6-DoF flight simulation environment, (MATRIXx). Figure 4-6 compares performance 
of the three systems: a) Baseline/No Failures (black), b) Baseline/Failures ON (red), and c) 
Adaptation ON/Failures ON. As seen from the figure, the adaptive system 15 able to recover its 
baseline/no failure performance, while accurately tracking the desired inner-loop commands. 
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Figure 4-6. Inner-Loop Adaptation with RIB/ROB Elevon Failures @ Trim 


The adaptive system performance was evaluated using an extensive matrix of test cases. 
Linear data for the reference model (4.10) were scheduled throughout the aircraft flight 
envelope. Required by the adaptive laws (4.34), the unique symmetric positive definite solution 
P of the corresponding algebraic Lyapunov equation (4.25) was approximated on-line using the 
iterative procedure outlined in Appendix C. 
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4.6.2 Internal Model Principle-Based Controller 


An IMPNOR control scheme was tested for various inputs and failure conditions. Figure 
4-7a shows the simulation results of the inner-loop system with the IMPNOR controller when the 
right outboard surface (ROB) failure is present. As can be seen, the IMPNOR controller 
sufficiently recovers the nominal response of the aircraft, by reducing the oscillation. In Figure 
4-7a, 4-7b, and 4-7c, the legend IMP is used for IMPNOR. 


Figure 4-7a. Aircraft Response without Pitch Break Phenomenon with IMPNOR Control 
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For the body axis angular accelerations, P,Q,R the IMPNOR controller adequately reduces 
the oscillations that they experience with a ROB failure, as can be seen in Figure 4-7b below. 
Although the angular accelerations P and Ὁ still exceeded that of nominal, the aircraft motion 
is under greater control than that without the IMPNOR controller. 
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Figure 4-7b. Aircraft Acceleration Response without Pitch Break Phenomenon 
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Figure 4-7c shows the deflections of the individual surfaces and that the aircraft’s surfaces 
are within actuation position limits and do not suffer from the same oscillations that occur 
without the IMPNOR controller. 
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Figure 4-7c. Surface Deflections in Flight Conditions without Pitch Break 
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Next, the same conditions and inputs were applied to the same aircraft model, only this time 
with the Pitch Break phenomenon active. From Figures 4-8a, 4-8b, and 4-8c, we see that the 
IMPNOR controller is able to recover the nominal aircraft response when the right outboard 
surface failure is present. In fact, the controller is able to dampen out the additional errors 
induced by the Pitch Break phenomenon in A, and B better than the baseline controller, giving 


much better damped responses to all inputs. 
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Figure 4-8a. Aircraft Response with Pitch Break Phenomenon 
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Figure 4-8b. Aircraft Acceleration Response with Pitch Break Phenomenon 
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Figure 4-8c. Surface Deflections in Flight Conditions with Pitch Break 
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4.7 UCAV X-45A Guidance (Outer-Loop) System Augmentation 


In this section, the X-45A guidance system is introduced into the adaptive design process. 
Two guidance models, 1,,,, =3 — dimensional longitudinal and n,,,, = 2 — dimensional lateral- 


directional, are introduced: 


Xiong = Along Xiong +B long Yong ( 4 3 7) 
-΄ x, ἘΠ, πὶ τ ρα 
y long long “long long “long ~— 
i = Ana Xara + Brad Ἡ μια (4.38) 
es _ acmd ᾿ 
Vira = Crata Xara + Phara Utara = ᾧ 


The three dimensional input vector u,,,. to the longitudinal guidance model (4.37) is defined 


as: 
Vong =(h°™ hh). (4.39) 


where f°", h andh denote commanded altitude (ft), actual altitude (ft), and aircraft vertical 


speed (ft/sec). Commanded altitude acceleration he" (ft/sec/sec) is computed as the system 
(4.37) output signal, (that is y,,,. = hen), 


The two dimensional input vector u,,, to the lateral-directional guidance model (4.38) is 
defined as: 


Mina = ("wy (4.40) 


where yw“ and yw denote commanded and actual heading (yaw) angles in degrees. 
Commanded roll angle ¢°”” (deg) is computed as the system (4.38) output signal, (that is 


cmd 


Vg = 9°" ). Based on the latter, the commanded roll rate ρ΄ (deg/sec) formed: 


pr =(9°" —O)k, (4.41) 


In (4.41), positive constant k, represents a commanded roll gain (1/deg). In addition, the 


cmd 


commanded yaw rate r°”” (rad / sec) signal is specified as: 


sin 
yond = BSP (4.42) 
V 
where V denotes the aircraft true airspeed. 
Using the two guidance outputs h°" and 6°”, vertical acceleration command a‘ is 


computed: 


a - (4.43) 
COS (gor 
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Combining (4.43), (4.41) and (4.42), the vector of inner-loop commands is formed: 


T 


pm (gm —g)k, 855 (4.44) 


ὩΣ 
“| cos (o"" 


In order to properly account for the 4 extra degrees of freedom (h, ἢ, φ, y’| used in the 
guidance model, their approximated dynamics 15 written as: 
h = —a, COS@ 
p= p (4.45) 
Ww=rcos¢+qsing 


Let Bone Sar plone by” | and B,, = ἕω bet | . Augmentation of the (Plant + 


Inner-Loop Controller) — system (4.8) with: a) extra 4 degrees of freedom (4.45), b) longitudinal 
guidance (4.37), and c) lateral-directional guidance (4.38), results in the augmented dynamics: 


χε Ax+B,A(6+K,(x,))+B,u 


h=- (C, 0)x+D, A(5+K,(x,)) cos @ 
ke 


ς 


(4.46) 
ῴφ-ρ 
w=rcos¢+qsing 
Χ = Aone Xiong + bee h a τ td h 23 ie h 


. ae latd md latd 
Xtata = μια Xara * ὃ, yen + BW 


a 
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Assuming “small angles” and rewriting (4.46) in a matrix form, we get: 


x A 0 060 0 0 0 90 x 
h 0 0 1 0 0 0 90 h 
h Cc 0 0 00 0 0 h 
dl=le 0 0 0 0 0 0 d 
Ws e 0 0 0 0 0 0 yy 
Ms 0 be bee 0 0 A, 0 ie 
ius J 0 Od 0 : Dee 108 Ae Aes 
x Ay τ 
B B, 0 0 0 
0 0 0 0 0 
-D, 0 0 0 0 ifr 
+| 0 A(5+K,(x,))+ Olwt+} 0 O 0 | B™ 
0 0 0 0 0 tw" 
0 0 pee 0 01 
0 0 0 0 4” 
ἘΞ τὶ Se ee 
or, equivalently: 
X= A,X+B,A(5+K,(x,))+B,u+ Bu (4.47) 


In the formulation of the augmented system matrix A, in (4.47), two unit vectors (e,, ες) 


were introduced: 


e,=(0 0100000 0) 
: (4.48) 
e=(0 00010 0 0 0) 
Additionally, matrix element Av was set to a nonzero value that represented the aircraft 


bank angle (g) influence on the sideslip (f) dynamics. 


Let Dong = | aie die iene ie and ἢ, = ἔα dy |. Then using again the “small 


angles” assumption, the vector uw of inner-loop commands (4.44) becomes: 


u=( “fm pr (g°" -O)k, Ὥ (4.49) 
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In (4.49), guidance output signals he and g°”" are defined by (4.37) and (4.38), 
correspondingly. 


h cmd _ on ἜΝ Ἴ: ae heme te 7 h+ απ h 
cmd __ latd cmd latd (4.50) 
Or" = Crna Xaatd ψ τῷ W 
Using relations (4.50) yields: 
ΞΟ Xiong τ ae ae . a h = a h 
» 
μΞ (Ομ Maa απ 0 +s” wo) Kk 
& 
Ξ ῴ 
or, equivalently, in a matrix form: 
x 
0 -αῷ —gere 0 0 Cyn Ὁ h 
0 0 0 0 0 0 0 h 
u=|0 0 0 -k, πὶ 0 Cyak, || ? 
0 0 0 5 9 0 0 ᾿ 
V Xiong 
Fy Natd (4.51) 
—d,°"? 0 0 Rom 
0 I ᾿ pr =F'X+F a 
0 0 ὦ," k, ΓΒ. 
0 0 0 en Le 
er, en iu 
RT 


Finally, substituting (4.51) into (4.47) yields the complete system dynamics: 
x= (4, +B, FT Χ 18, A(d+ K, (x, ))+(B, + B, Fy )a 
Nt, ,..--’ $$ 
A B, (4.52) 


Note that in (4.52) a three dimensional vector of the outer-loop commands 
u = (ee pm an : was introduced. From (4.51) it follows that the inner-loop commands 


u are determined through the outer-loop commands w and the system state x using the 
feedforward gain matrix F, and the feedback gain matrix F. , correspondingly. 
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where the symmetric positive definite matrix P satisfies algebraic Riccati equation: 


Al, P+PA,,-PB,, RB, P+O=0 (4.57) 


Using (4.55) and (4.56), the corresponding closed-loop reference model dynamics become 
asymptotically stable and can be written as: 


y= (Ay + Boy FE) rep + (Boop Εν = AG Xe + BG (4.58) 
Aa Bo, 


At the same time, feedforward gain F, can be chosen to achieve a desired DC gain matrix 
σι, of the reference model (4.58). 


-( 4%, ΒΕ ΞΟῚ (4.59) 


The unique optimal solution of (4.59) 15: 


ref ~ ref 


Sis 


FY - -ἰ δ 5 i ST, Guy = Si. Grey (4.60) 
Sich Sais 


Ι -- 
where Si. is the pseudo-inverse matrix for the rectangular matrix S~= =(A2,) Bey. 


Note, that the newly derived asymptotically stable reference model (4.58) is similar to the 
original reference model (4.54). Consequently, without a loss of generality and throughout the 
paper it is assumed that matrix 4. is Hurwitz to begin with, and that the reference model is of 


the form of (4.54). With that in mind, the outer-inner-loop contro] objective can now be 
formulated. 


The outer-inner-loop control objective is to design state feedback inner-loop control 6 for 
the complete system (4.52) such that despite the system uncertainties A and K, (x, ), all closed- 
loop signals in (4.52) are bounded, and the system state x tracks the state x,,, of the reference 
model (4.54). 


Since the system (4.52) and the reference model dynamics (4.54) are similar to the system 
(4.8) and its corresponding reference dynamics (4.10), the adaptive control design process is also 
similar to the design that was introduced in Sections 4.3, 4.4, and 4.5. Repeating the design steps 
that lead to the inner-loop adaptation (4.34), the new adaptive laws can be written that explicitly 
account for the guidance loops: 


ke =T, Proj( 


(4.61) 
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In (4.61), @=x—X,,, represents the complete system tracking error, P is the unique 
symmetric positive definite solution of the algebraic Lyapunov equation: 
Avy P+PA,,=-Q (4.62) 
and QO is asymmetric positive definite matrix. In addition, symmetric positive definite matrices 


ΓΙ, l,, 4 must be chosen to specify desired rates of parameter adaptation. 


As it was done in (4.33), the adaptive incremental control is embedded into the nominal 
inner-loop linear feedback. The total inner-loop control takes the form: 


6=6,(x,,x,,u)tk x+k 7-0" (x, ] (4.63) 
nominal adaptive 


It is designed to a) change / reconfigure the nominal feedback gains K,, Κὶ,, Ὁ) feed back / 
feed forward the guidance states / commands, and c) identify / counteract the nonlinear matched 
uncertainty K, (χ,) in the original system (4.1). As ἃ result, bounded tracking of the reference 
model (4.54) trajectories is achieved in spite of unknown control failures and unmodelled 


nonlinear dynamical effects. The corresponding closed-inner-outer-loop system block-diagram 
is shown below. 
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Figure 4-9. Closed-Inner-Outer-Loop System Block Diagram 


Note that if the reference model is defined by (4.58) instead of (4.54) then P should be taken 
as the unique solution of the algebraic Riccati equation (4.57). Also, the total inner-loop control 
input takes the form: 


ὃ- ὃ, (χ,.χ,»") ἐκ χε [Εἰ ΟΝ ν) -67 Φ(χ,) (4.64) 
nominal guidance augmentation 


adaptive 


with the inner-loop commands vector u defined as in (4.51). 
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με ΕΓ ΧΕΙ  a=FIx+F] (ΕΓ χα Εἰ ν) 


ΞΙ ΕΊΣ: ΕΓ ΧΕ Εἶν- ΕἼ χει Εν (4.65) 


The corresponding closed-loop system block-diagram 15 shown in Figure 4-10. 
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Figure 4-10. Closed-Inner-Outer-Loop System Block Diagram 
with Guidance Augmentation 
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Figure 4-11a. Inner-Outer-Loop Adaptation during Max Rate Climb/Level Off 
with ROB Elevon Failure and Pitch Break Phenomena 
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Figure 4-11b. Inner-Outer-Loop Adaptation during Max Rate Climb/Level Off 
with ROB Elevon Failure and Pitch Break Phenomena 
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Figure 4-11c. Inner-Outer-Loop Adaptation during Max Rate Climb/Level Off 
with ROB Elevon Failure and Pitch Break Phenomena: Control Deflections 


4.9.2 Internal Model Principle-Based Controller 


To evaluate the performance of the IMPNOR controller with the Outer Loop command for a 
climb and level-off maneuver, simulations were performed and the results are shown in Figure 
4-12a, 4-12b, and 4-12c. Note that the Pitch Break phenomenon is active. As can be seen from 
Figure 4-12a, 4-12b, and 4-12c, unwanted oscillations caused by either the surface failure and/or 
the Pitch Break phenomenon are effectively cancelled while keeping the surfaces within their 
actuator position limits. 
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Figure 4-12a. Aircraft Response to Climb/Level-off with 
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. Aircraft Angular Acceleration Response to Climb/Level-off with 
Pitch Break Phenomenon and ROB Failure 
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Figure 4-12c. Aircraft Guidance Response to Climb/Level-off with 
Pitch Break Phenomenon and ROB Failure 


4.10 Conclusions 


Design, stability analysis and simulation evaluation of a direct adaptive model following 
control system was presented. The adaptive laws were theoretically justified and derived based 
on the fundamentals of the Lyapunov Stability Theory. The system was designed to augment an 
existing nominal robust baseline controller with an adaptive online trained feedforward neural 
network. The adaptive element provided for reconfiguration in the presence of system 
uncertainties (e.g., battle damage), control failures, and environmental disturbances. Using 
UCAV X-45A model and data, a step-by-step adaptive control design process was presented. 
Radial basis functions (RBF) and single-hidden-layer (SHL) feedforward neural network (NN) 
architectures in conjunction with the projection operator were employed to derive the adaptive 
laws. The adaptive element provided bounded tracking and a uniform ultimate boundedness of 
all the signals in the corresponding closed-loop system. Both the inner-loop design and the 
outer-inner-loop (explicitly accounting for guidance) adaptations were performed. Benefits of 
the adaptive system and its performance were evaluated in the UCAV X-45A flight simulation 
environment. 
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5.0 ADAPTIVE CONTROL OF ELECTROMAGNETIC ACTUATORS 


Actuator nonlinearities are among the key factors limiting both static and dynamic 
performance of feedback control systems. Harmful effects of backlash in gears are well known. 
In fact, such nonlinearities were observed and reported in a technical memo [27]. In this memo, 
it was reported that the quantizer and backlash effects dominated the nonlinear response, and that 
the response became very nonlinear as the command magnitude (fin deflection angle of a 
missile) decreased. 


Backlash prevents accurate positioning and may lead to chattering and limit-cycle-type 
instabilities. This increases wear and tear on the gears, which in turn, increases backlash. 
Various anti-backlash gears have been proposed as a solution to this problem. However, their 
cost is high, and they introduce extra weight and friction. Attempts to eliminate or reduce 
backlash by increasing the gain of the feedback loop lead to sustained oscillations which may 
cause rapid wear of gear trains, valves, and other components. Thus, intelligent but inexpensive 
approaches to increase accuracy against the nonlinearities in the actuator dynamics are needed in 
order to improve aircraft and missile performance without additional cost. 


G. Tao and P. Kokotovic have done extensive investigation on adaptive control of systems 
with actuator and sensor nonlinearities [28]. We will follow their methodology (we will call it 
“Adaptive Inverse Method” in this report) to implement an effective nonlinear control scheme 
for actuators. We will also apply the Internal Model Principle technique to compare the results 
with those by the Adaptive Inverse method. 


5.1 Inverse Models 


The method proposed by Tao and Kokotovic uses an inverse model to cancel the nonlinear 
effects induced by dead-zone and backlash. That is, if we let u = N(v) represent the nonlinear 
element in the actuator, where v denotes the input to the actuator and u the output from the 
actuator, then the compensator for the input nonlinearity N(.) is v= NI(ug). Here, NI(.) is the 
desired inverse and ug is the control input which would achieve the control objective in the 
absence of N(.). Thus, when the inverse is exact, u = N(NI(ug)) = ug achieves the control 
objectives as if N(.) were absent. However, in reality, NI(.) is seldom exactly known and only an 
estimate NI..(.) of the inverse NI(.) can be implemented. 


5.1.1 Dead-Zone Inverse 


Let b, (> 0) and b; (< 0) denote break-points, and m, and m,, represent the slopes of the linear 
portion of the dead-zone characteristics as shown 1 in Figure 5-1. For a linear parametrization of a 


dead-zone inverse, we use the estimates m b m,, M, ἢ , m, οἵ μι, δ. m,, m,b,, m,, 


ror? 


respectively. The estimates of b,, b, are then obtained from 


je (5.1) 
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Then the inverse model for the dead-zone characteristic is described [24] by 


A 


Ξε ἐιξῦ 
mM, 
26 if ug(t) = 0 
ar ANS Baas if ug(t) «0 (5.2) 
mM, 


M,: slope 


M,: slope 


GP44275043.ppt 


Figure 5-1. Dead-zone 


5.1.2 Backlash Inverse 


One of the damaging effects of backlash on system performance is the delay corresponding 
to the time needed to traverse an inner segment. Another undesirable effect of backlash is the 
information loss occurring on an inner segment when the output u(t) remains constant while the 
input v(t) continues to change as shown in Figure 5-2. 


GP44275044 ppt 


Figure 5-2. Backlash Response to a Sawtooth Input 
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These undesirable effects are eliminated with the backlash inverse v(t) = BI(ug(t)) where v(t) 
is determined by 


(t) 


MSG aan ae: for 
m ἢ 
if a(t) <0, v(t) = UMD) ἊΣ 
{4 


= 0 if u(t) =0 


= ρί Ὁ if u(t) > 0, v(t) = u,(t) 
m 


ΤΟ 


Sop Shade te (5.3) 
m 


r? 


In this definition, the inverse of a horizontal segment of the backlash characteristic 15 a 
vertical jump defined as the time integral of the impulse function 
g(T,t) = O(t-t)(c,-c)) (5.4) 


where δ( is the Dirac 6-function. Thus an upward jump in the backlash inverse 1s 
+ re u,(t ) 
v(t’) = v(t) + [e(e.odr=-— +c, (5.5) 
: m 


The effect of this jump in the backlash inverse model will be to eliminate the delay caused by 
a segment in the backlash characteristic. In a similar manner, the use of the backlash inverse 
model given by (5.3-5.5) restores the information that would have been lost without it. When the 


A 


backlash parameters m, ΟἹ, ον are unknown, we will replace m, Οἱ, and c, with their estimates m, 


A A 


c,, Ὁ, in equations (5.3-5.5) to design a backlash inverse estimate. 


5.2 Adaptive Inverse Control 
The objective is to control a system with a nonlinearity N(.) at the input of a linear time- 
invariant part G(s), which is described by 
y(t) = G(t) [u](t), u(t) = N(v(t)), (5.6) 
Z(s) 
Ρ()᾽ 


where v(t) is the control input and y(t) is the measured output. The linear part is G(s) = Κα 


where k is a constant gain and Z(s), P(s) are monic polynomials in the Laplace transform 
variable s. Our goal is to develop a method to design a feedback control v(t) using the measured 
output y(t) to achieve global stabilization and close tracking of a reference signal y,,(t) by y(t). 
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The control strategy proposed by Tao and Kokotovic uses a linear model reference controller 
structure plus an adaptive nonlinear inverse model to be placed at the input of the nonlinear 
element. To employ a model reference approach, they characterize the reference signal y(t) as 


ym(t) = Wm [r] (0 | (5.7) 


where Ν᾽ is stable and the command input r(t) is bounded and piecewise differentiable and 
impose the following assumption: 


Al) Z(s) is a stable polynomial, 

A2) The degree n of P(s) 15 known, 

A3) The relative degree n =n-—mof G(s) is known, 

A4) The sign of the gain k is known, 

A5) Wm(s) = Pm _(s) for a stable polynomial P,,(s) of degree equal to the relative degree n*. 
With these assumptions together with the assumption that excludes the inappropriate range 


for parameter, they established the controller structure shown in Figure 5-3. 


If (0) and r(t) are both available and, r(r) is bounded, then the following controller 
structure is suitable: 


u,(t)=0, (ἢ +8, O,(t)+ A y(t) +, 10) (5.8) 


* * * * 
where @, and @, are vector parameters; and @,, , 8, are scalar parameters. 


GP44275045.ppt 


Figure 5-3. Controller Structure for G(s) Known 
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In the filters 


(s) 
0, (1) =~ [u, |), 
os (5.9) 
_ als 
QO, (Zt) = A(s) [ν](ἢ), 


the vector a(s) 1s 
a(s) = (1,8, ....,8")" 
and A(s) 15 a stable polynomial of degree n-1I. 
The parameters 
θ᾽ ε R"',0, € R"',0,, €R,O, ER 
are chosen to satisfy the matching equation 
θ᾽ a(s)P(s) +(0," a(s) + Oy, A(s))kZ(s) = A(s)(P(s) -- 6, kZ(s)P,(s)) (5.10) 
With this controller structure a dead-zone inverse is directly implementable, while the 
backlash inverse requires ua(t), which can be obtained by 


' sa μ 1 - 9.7 sa(s) 


᾿ θ 
“(Ὁ τ A(s) ΛΟ) 


[}](Ὁ + A) y(t) +0, r(2). (5.11) 


An alternative controller structure is also developed for the case when y(f) 15 not available, 
but r(t) is measured and bounded (see [24]). If κι or r(t) is not available, then we can 


employ ἃ simple approximation of uwa(t) such as 


᾿ RY 
μα() = 
TS + 


[u, ](t) (5.12) 
where T > 0 is a small constant and y(t) and r(t) as 


y(t) = plo 
on (5.13) 


r(t) = ——[r](0). 
tT +1 


Based on the tracking error equation derived by Tao and Kokotovic, 


e(t)=W(s)[(Oy -- Oy)’ @y Ἰ(ἢ +d) (5.14) 
where d(t) = W(s)[dy/](t), the following update law can be adopted: 


Oy(t) = — “(ὃ (5.15) 
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where fy(t) is a modification term, 
Ey(t) = e(t) τ ἔν (ἢ 
Ey (t) = Oy (t)6y(Q-W(s)LBy ὦ, MD) 
Cy (t) Ξ (soy IO 
my (= Υ1 Ὁ ςν΄ (ἢς, (ἢ τῷ, (ὦ 
DT, = diagty,,..+5 Vii 17, >O,i=1,...,my 


“A 


6,,=8, =(m,,m 


(5.16) 


A “A A A 


b.,m,,m,b,)' or@, =(mc,,m,mc,)° 


pore 
and ny is the dimension of θ᾽. 


To Evaluate the performance of the Adaptive Inverse Control scheme, simulation was 
performed using the Miniaturized Munition Technology actuator model shown in Figure 5-4. 
The model contains nonlinearities such as deadzone, Coulomb friction in the motor, Coulomb 
friction in the load, and Quantization errors. The simulation results showed a potential that the 
Adaptive Inverse Control technique can provide an improved performance in controlling the 
actuator dynamics compared to a controller with a detuned fixed inverse or no inverse. 
However, the simulation results also showed the following issues that need to be addressed for 
practical implementation of the method: 


Quantizer 


OF | Radian 
ΠΝ Quantizert a) TLfric_todeg \K/ 


ἊΝ € 
KF To 
KG Workspace 
100] delta 
T 


Signal 


Ο Scope 
Generator Workspace1 
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Figure 5-4. Matlab MMT Model 
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1. Control chatter — “softening” the inverse could reduce the chatter at the cost of increased 
inversion errors. 


2. Adaptation gain —lack of this logic made the adaptation process extremely slow. 


3. Robust adaptation — need of robust adaptation logic that would allow larger initial 
parameter errors at the expense of increased tracking error bounds. 


We consider an alternative method that alleviates these potential problems. 
5.3 Application of Internal Model Principle Output Regulation for Actuator Control 


In this section the internal model principle is applied to the actuator control problem where 
there exists a dead zone that limits the control authority needed to achieve some desired task. 
Recall that graphically, the dead zone can be represented by a linear piece-wise continuous 
input-output map. 


s,(v—a,)+h v2a, 
DZ(v) = h πα «νέα, 
S(vta)t+h v<-a, 


where s, and s, are the slopes of the input-output graph to the left and to the right of the dead 
zone area, respectively,-a, and a, are the dead zone left and right boundaries, respectively, 
and h is the vertical offset. The dead zone problem has been addressed by Tao and Kokotovic 
[28] using adaptive control methods to realize a dead zone inverse, denoted by DZ™'(u), to 
cancel the effect of the dead zone. When the parameters of the dead zone are known, then the 
dead zone inverse can be constructed easily. On the other hand, when the dead zone parameters 
are uncertain, the dead zone inverse can be reconstructed by an adaptive method or, in this case, 
via an internal model. The mathematical representation of the inverse dead zone 1s 


see u>h 

S, 
DZ" (u)= 0 u=h 
poe u<h 

5 


It can be shown that DZ(DZ'(u))=u and DZ''(DZ(v))=v. The adaptive method attempts to 
estimate the unknown parameters of the inverse dead zone, whereas the internal model method 
seeks to estimate the required output of the inverse dead zone. 


To understand the method first consider the simple system 
x= bx+ DZ(v) 
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where -1< b< 1 and the dead zone parameters are uncertain but are known to lie on some 
range. Now assume that the goal 1s to track the output X;cr of the reference system 


* 


Wy G 
X of τ γὴ 


If b is known and the input to the plant 15 
V = DZ (hye + Xr — ὄχ.) 


then the problem is solved for some kp=ko(b,R(Dzp)) where R(Dzp) represents the range of the 
dead zone parameters. The rate of convergence is determined by the choice of ko. Since the 
parameters are not known at any given time, this input cannot be constructed directly. Therefore, 
the desired input at any given time 1s 


ake Or u>h 

52 
v= DZ"'(u)= 0 u=h 
ial u<h 

sj 


where 
U = Kye + X,o¢ — OX, 


Given the ranges of the parameters, it is possible to choose ko for robust stability, however, 
the gains on the reference based-terms must be specified exactly. Since x,o¢ is generated by the 
dynamic reference model, an immersion is sought to bypass this requirement. The reference 
dynamics with output u 


ae 
Ww, ΞΞ --ν is immersed into 
uU=aw,+a,w, +a, 


€=4(€) 
μ τ: γ(ζ) 


The theory of system immersion is outlined in [28] and will not be repeated here. The 
important point is system immersion allows for the realization of a controller whose output is a 
copy of the steady-state input needed to regulate the stabilized system despite bounded 
uncertainty in the plant parameters. 


The controller that is used is 
4 = δ᾿ + ne 
b2 -- ἰῷ +n, 


δ. τ os + n,€ 
Gi ΚΕ, + Le 
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where the parameters {n),n2,n3,K,L,M,I',} are chosen to robustly stabilize the interconnection, 
yielding satisfactory tracking for small and large inputs. The small inputs are more affected by 
the dead zone than the larger inputs as will be shown. 


x = bx + DZ(v) 

$1 = ξ) tme ts 

ζ, = ξ, he μή Bik 

&,=-&, +n,e ic eee: 
οὐ, Δ, e€=x-w, 


€, =Ké,+Le 
v=T,6,+ Me, tke 


s,(v—a,)+h v2a, 
DZ(v) = h —a<v<a, 
s(vta)+h vs~-a, 


where —1<5<1, the dead zone parameters satisfy the constraints s,,s, Ε [1,5], he [-1,]], 
a,,a, €[0.1,1]. The single state of the plant was rate limited (+100 units) and position-limited 
(+20 units). 
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The internal-model controller formulation is now applied to the Matlab MMT Model shown 
in Figure 5-7. The highlighted block is the internal model, and comparison is done for cases 
where the commanded signal is fed directly into the actuator and where the commanded signal is 
fed into the internal and the output of the internal model drives the actuator. 
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Figure 5-7. Matlab MMT Mode! with Internal Model-Based Controller 


The nominal system (no internal model) suffers from limited response for “small’ inputs due 
to the piecewise linear dynamics in the actuator. It will be shown that the internal model greatly 
improves the response in these cases as well as maintains satisfactory response for larger inputs. 
The problem is approached in the same way that the previous problem was formulated and 
solved. The form of the IMPNOR scheme is the same with different regulating gains. However, 


in this case, no stabilizing controller was necessary due to the strong stability properties of the 
actuator oo 
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Figure: 5-9. Plots Showing Tracking for ΘΗΝ Reference Input without IMPNOR Scheme (left) 
and with IMPNOR Scheme (right) 


The spike-like behavior observed for the IMPNOR scheme can be removed by filtering the 
feedback signal x(t) without adversely affecting the overall performance. The most important 
feature is the improvement gained for ‘small’ inputs. In fact, if the amplitude of the reference 15 
reduced to 0.15 units, then the system without the IMPNOR scheme produces no output due to 
the dead zone. On the other hand, when the IMPNOR scheme is in the loop, some tracking is 
recovered. See Figure 5-10. 
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Figure 5. 10. Plots schewack No ourecking for Ven Smal innate without IMPNOR Scheme (left) 
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5.4 Summary ᾿ 


The IMPNOR method is able to maintain tracking in the presence of dead zone and Coulomb 
friction. The IMPNOR scheme realizes a controller that generates a copy of the required inverse 
dead zone dynamics needed to cause the tracking error to asymptotically approach zero. The 
main benefit lies in the ability to track small amplitude signals in spite of the dead zone. 
Therefore, the overall performance of the baseline system is improved noticeably when the 
internal model 15 added. 
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6.0 DEPARTURE ANALYSIS TOOL FOR UCAVS 


Extensive investigations of robustness analysis for linear systems were done during the 
1980’s, which brought together many useful results with the goal of developing a practical 
robustness analysis tool. Two early significant developments, upon which many robustness 
analysis techniques rely, may be the structured singular value analysis by Doyle et. al. [29], and 
the stability margin analysis with real parameter variations by DeGaston and Safonov [30]. The 
structured singular value analysis provides a way to predict the exact stability margin if the 
system has dimension less than four, and the lower bound of the stability margin if the system 
has dimension greater than or equal to four in the presence of uncertain complex parameters. 
The method developed by DeGaston and Safonov provides a way to predict the exact stability 
margin in the presence of uncertain real parameters. At the industry level, Wise implemented the 
latter method successfully into a numerical tool called ROBUSTR [31], and performed 
robustness analysis for various missiles and aircraft applications. 


Analysis of nonlinear systems, on the other hand, has stayed mainly at the theoretical level. 
A large body of work has been devoted to extend the linear analysis techniques to nonlinear 
systems. However, computational methods have not been developed to a level comparable to 
their linear counterparts. In predicting aircraft departure, much computation is required to 
perform nonlinear robustness analysis when linear analysis can not be used to approximate the 
highly nonlinear unsteady aerodynamic characteristics. 


Murray et. al. presented a numerical method for robustness analysis for nonlinear systems 
[10]. Murray’s method is briefly summarized below in order to establish a basis of our approach 
toward the development of a numerically efficient tool for nonlinear robustness analysis. 


6.1 Problem Formulation 


Consider the nonlinear system given by 


x = F(x,u,v,6,U,t) (6 1) 
X(t) = Xq 
where F represents a smooth nonlinear function of a state x, a set of signals u corresponding to 
external disturbances, a set of signals v corresponding to the effect of components which are not 
independent of the other signals, but exact dependence is not known, a set of parameters 6, and a 
set of nominal commands U responsible for steering the nominal system along the nominal 


trajectory. The trajectory is described by a set of coordinates Y, given by a smooth function G of 
the same variables 


Y = G(x,u,v,0,U,f) (6.2) 
The nominal trajectory Y,, is denoted by 
Y =G(x,0,0,6,,U,t) (6.3) 


where 6, are the nominal values for the parameters. 
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Given an uncertain nonlinear system given by the equation (6.1) and the nominal command 
signal U, with initial conditions satisfying 
¢,Sx,(4,) $C, PLZ, cal; 
the problem is to determine the maximum value of the norm of the error signal 


|p| =|G@,u,v,6,U,0-Y, (6.4) 


subject to the constraints 


lui, =, aT 2 atm 
vl, =z: i= 1,2,...,p 
4; < ὃ < D; p= 2. ΡΥ 


where z is a variable representing uncertainty output given by a smooth function H 
z= H(x,u,v,6,U,t). 
6.2 Necessary Conditions 


The necessary condition for a local maximum of the problem stated in the previous section is 
given by the following theorem: 


Theorem: For a dynamical system described by the equations: 
x= f(x,u,t) 


x(0) given 


αἰ ΞΕΈΞῚ; 
given a performance index of the form 


'y 
j= | L(x,u,t)dt 


fo 
and a restriction on the final state 
G(x(t)) = ς, 


if the signal up achieves an extremum of J, then there exists a vector of constants ¢ and a solution 
to the two-point boundary problem 


_(aL\ (af) 
= (Ss) 
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with boundary conditions: 


x(0) given 


ac ) 
A(t ;) ee τ ἢ 
ax(t,) 
Furthermore, if these conditions are met we will have 


oJ 


A(t; ) ~ ax(0) ᾿ 


6.3 Implementation Issue 


Although the technique discussed in [10] is intended to demonstrate a numerically efficient 
method for robustness analysis for nonlinear system, the technique requires to solve the two- 
point boundary problem as discussed above, which is normally time consuming. Boeing visited 
Dr. Murray at California Institute of Technology (Caltech) to find out whether or not further 
research that advances the technique has been done. After the discussion, we learned Dr. Murray 
and his research group at Caltech continue to investigate stability and robustness analysis 
method, however, the research direction has been changed, and any significant advancement of 
the technique of interest has not been made yet. Therefore, we decided not to investigate the 
technique any further toward the implementation of the departure analysis tool at this time. 
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subject to a system 


X=f(X,u,t), t<t<ty 


where ἢ is at least a C’ function of X. Then Dy. solving the Euler-Lagrange equation, we have 


i=-(2)>- Fy 


= (52) * (5) 


(7.3) 
with boundary conditions: 
X (to) = Xo 
Oh \t 
Mt) (axa) 
(7.4) 
Now, to solve our problem of interest, we set 
(eae 6 foo - 
τ: ΠΧ ΤΠ: a. 


where t; E[0, 7] denotes the instance at which the minimum separation occurs, 1.e., 


|X (és) || = min [ΧΩ i 
0<t<T (7.6) 
and where u(t) € R denotes a roll rate command, and X(é) ε Κ΄ represents a relative aircraft 


position. Then Eq. (7.3) yields 


ὡς δὲ 
aS (Ὁ a 
with the terminal condition 
_— XX (ts) 
X= TIF (ty Me (7.8) 
and 
Of 
Since 
_ on Seep ae_2lE sp) 
ἐμ ΠΟ “ 
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the optimal control 1° 


wo= (κόρα ee σαν 


By integrating u°(t) from t = fo to t= ἧς the optimal roll angle ° is determined as 


= fig ur(t) dt 
= | Ὡ (6) at ἢ St (ox) ἀξία 
“Ἀδὰ Π“(}}}}.᾿ (7.12) 
This roll angle can be shared between two aircraft such that 
sa sea” πε: 
[΄ῶτ [om = 0 ie 


where u(f) i = 1, 2 is the roll rate command for aircraft 1 and 2, respectively. 


To calculate the optimal roll angle 6°, the knowledge about ¢;is required a priori, which is 


not known. Therefore, we need to find ¢;as the fixed point solution of the following set of 
equations: 


Χ = f(X,u’, ἢ, X (to) = Χο 


v= (5) eee 


IX(t,)I| = win ΠΧῸ] 


oo (7.14) 
To specify the smooth function fCX, u, ἢ. we need an aircraft model. 
7.2 Simple Generic Aircraft Model 
In the earth-fixed coordinate system, a generic aircraft dynamics can be derived as: 
Y= + (heose — D)—gsiny 
i _ NG sin ᾧ 
V,.cos ¥ 
y = TAU cos ᾧ — cos 7) 
ᾧ = p+psin(y+a) (7.15) 


Here, Vir > %  % Lh, D, P,Nzs g and m denote the airspeed, heading angle, flight path 
angle, bank angle, angle of attack, thrust, drag, roll rate, normal acceleration, gravitational 
acceleration, and aircraft mass, respectively. The normal acceleration N, and the incremental roll 
angle AC are commanded to generate the escape flight path. In addition, the responses of the 
roll rate p and normal acceleration N, to their command inputs are approximated by the low order 
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equivalent systems, which are described by first and second order responses, respectively. 
That 15: 


p= τ ὦ = Pema) 
. (7.16) 


and 


᾿ : ~ 
N, + 2¢wN, + w*(N,z — Nema) = Ὁ (7.17) 
where τ, C and ὦ respectively represent a roll mode time constant, damping ratio, and short 
period frequency. Nzema and Pema are Maximum allowable normal acceleration command and roll 
rate command, respectively. The maximum allowable roll rate command is integrated over a 
certain period of time to meet the commanded roll angle. In the projected escape flight path 
calculation, the throttle setting is assumed to be constant. In other words, the thrust 7), is 
assumed to be constant throughout the projected escape flight path. In addition, the angle of 
attack o 15 calculated by solving 


_ Ι,-- Τὰ sina | 
mg | (7.18) 


where L denotes the lift. The lift L and drag D are calculated by looking up a table for each 
flight condition using a generic aircraft model. 


Nz 


Then the velocity component of the aircraft in the earth-fixed coordinated system can be 
calculated by 


= V, cosy cosy | 
y = Vcosysiny 
i ee (7.19) 


By integrating equation (7.19) with respect to time, the projected aircraft position (x(4), (a), 
z(t)) at time t can be calculated. 


7.3 Summary 


An analytic solution to the Automatic Air Collision Avoidance problem was discussed. The 
current method requires a priori knowledge of the time at minimum separation. This necessitates 
the fix-point iteration before advancing the time. Therefore it is not viable for the real-time 
operation. However, the objective of the current research is not to develop a real-time method 
(for now at least) for the air collision avoidance problem, but rather to search for the analytic 
optimal solution. This would provide an insight into what needs to be improved for some 
real-time methods based on numerical approach, particularly when the optimality of the solution 
is questionable. Of particular interest is to evaluate how the optimal escape maneuver calculated 
mathematically compares with the escape maneuver a pilot considers the best, and to analyze 
what needs to be done in order to fill the gap between them. 
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APPENDIX A 
PROJECTION OPERATOR 


Definition A.1: Subset Πα Κ΄ is convex if 


| Vx, ye CR" |=>[Axt(l-A)y=ze I], VOSASI (AL) 
Relation (A.1) states that if two points belong to a convex subset ΠῚ then all the points on the 

connecting line must also belong to the same subset IT. 

f:R" OR 


Definition A.2: Function is convex 1f 


f(Ax+(I-A)y)sAf(x)+(1-A) f(y), VOSASI (A.2) 


Relation A.2 is illustrated in Figure A-1, and states that graph of a convex function must be 
located below the straight line that connects any two corresponding function values. 


GP44275053.ppt 


Figure A-1. Convex Function 


-R’ +R 


Statement A.1: Let 70) be convex function. Then for any constant O>0 the subset 


Π; ={0e R"| 7(6)  δ' 


Proof: Let Θ., θι ΕΠ. Then f(6,)<6 and f(@,)<6.Since f(x) is convex then for any 
O0<A<I]1 


is CONVex. 


θ «ὃ «ὃ 


jaa saya Jsaplays(t-afla)s2o4(-2)6=9 
ν--..-...-- ------- 
Therefore 7. (60) <6 and, consequently, @¢ II; which completes the proof. 


Statement A.2: Let f(x): RR be a continuously differentiable convex function. Choose a 
TI, ={0E R"| f(@)Sd}cR" ' 
5 | f(@) \ Re Gell 


constant δ» Ὁ and consider the subset δ and assume 
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Gell 


that f(6 ) : a (1.€., Θ᾽ is not on the boundary of I, ). Also, let δ᾽ and assume that 


16 )Ξ , (i.e., (θ᾽ is on the boundary of Ms ). Then the following inequality takes place: 


(6° -0) Vf (8) <0 (ΑΞ) 


Τ 

ο οΓ(0 

where Vf(@) = of (9) ᾿ς of (9) e Μ΄ is the gradient vector of f evaluated at 6. 
00, 00, 


Relation (A.3) is illustrated on Figure A-2. It shows that the gradient vector evaluated at any 
boundary point of a convex set always points away from the set. 


vf) 


GP44275054 ppt 


Figure A-2. Gradient and Convex Set 


Proof: Since f(x) is convex then 
f (AO +(1-A)O) <A f(6")+(1-A) f (8) 
or equivalently: 
7{9: λ(ὁ' -8))< Χ(6}:λ{..(0")-- F(4)) 
Then for any nonzero 0<A<1: 
7{9: λ(θ" -0)\- f(8) 
A 
Taking the limit as 2 -- 0 yields relation (A.3) and completes the proof. 


< f(6°)-f(0) <d-5=0 


Suppose that @, the “true” parameter vector, belongs to a convex set I], 


MI, Ξ[θε R"| f(8)<0} un 


Introduce another convex set: 


f(@)<1} me 
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It is obvious that IT, ΠΙ. We may now define the projection operator: 


y if f(@)<0 
Proj(@,y)=iy —_, if | £(@)20ay"VF(8) <0] 
ONTO 46), spect 

vr (4) ἋΣ 

or, equivalently: 
Vf (0)(VF(8)) | j 
————— @) ,if θ)» θα γ' ν(0θ).»0 
γ , if not (A.7) 


Namely, Proj(@, y) does not alter the vector y if @ belongs to the convex set Q, defined in 
(A.4). In the set {0< <f(@)s< I}, the projection operator subtracts a vector normal to the 


boundary { f(@)=A} from y so that we get a smooth transition from the original vector field y 
for 2 =0 toa tangent to the boundary vector field for 4 =1. The projection operator concept is 
illustrated on Figure A-3. 


Proj @y)." Vf() 


Mh= (θ0{{{θ) =A} "πος σ᾽ 


* 
. 
Γ 
“ee, 
Se 
®e, 


My= (070) < 0} 
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Figure A-3. Projection Operator 


Using Statement (A.2) and inequality (A.3), we get the following important property of the 
projection operator: 
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0, if f (0) «0. 
0, if f (0) >0 and γ᾽ Vf (@) <0 
(θ᾽ -0)' (y—Proj(0,y)) =4(%" -6) Vf (0)(VF(A)) » <0 (A8) 
<0 20 
Wop ae. 


or, equivalently 


(0-0°) (Proj(@,)—y) <0 (A.9) 
Based on (A.6), we can now define the projection operator when both Y and © are matrices 
of the same dimensions, that is: Y=(y, ... yy)eR” and O=(6, ... O,)eR™ 
Proj(©,¥)=(Proj(@,y,) ..- Proj(@,, vy )) (A.10) 


Relation (A.10) states that for matrices the projection operator is defined column-wise. 


Finally, we show that the use of the projection operator (4.29) makes the trace term in the 
Lyapunov function derivative (4.28) semi-negative. 


trace sa'| Won) P+) = AQ" (Proj(Q,,¥,)-¥,) 4, $0 (A.11) 
) 


ee ———— =] Aaa 
a Proj(,Y rane ee 


In (A.11), Ὡ, and Y, denote j-th columns of Q and Y, correspondingly, while 4, 


represents the j-th non-negative element on the diagonal of A. From (A.11), we get the same 
adaptation law as specified in (4.29). 


Next, we show how to define the convex function f(-) that in turn defines the convex set 


Π;;. Both definitions are based on the known upper bound ΙΩ ; | <Q) that is imposed column- 
wise. For each column of Q, choose a positive constant δ, that represents a desired projection 


tolerance, and define convex function: 


\_ [τ _ (27 2,)-(2"") 


f, = 1(Q;) car ane” a (A.12) 
and convex set 
Mi ={2,| 7, <6) (A.13) 
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Then for each j=1, ..., m: 


ml, ={2,| f, <0} ={0,||,|<07"} 
| : : (Α.14) 
Ππ| ={2,| f, <1 ={9, |, s(Q7"} +e, 
Adaptive laws (4.29) require computation of the gradient of the convex function (A.12). 
2 
Vf(Q,)=—O, (A.15) 
= 
Finally, it should be noted that the adaptation with projection ensures the uniform 


boundedness of each column of the adaptive matrix Q forward in time: 


a, (df s(Qm™) +e, Vs jsm,V120 (A.16) 
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APPENDIX B 
ADAPTATION USING SIGMOIDAL NEURAL NETWORKS 


The universal approximation property of single-hidden-layer (SHL) neural network (NN) 
architectures states that a continuous scalar function f(x): Κ΄ — R can be approximated by a 
SHL NN with sufficiently many neurons on a compact (closed and bounded) domain D c Κ΄ to 
within any given tolerance, [46]. SHL NN estimate ofa scalar function f has the following 
form: 


ΕΣ να x+h) +e, (B.1) 


i=] 
where N is the number of neurons in the hidden layer, we R” is the vector of output weights, 
v.€ Κ΄ is the vector of inner-layer weights, and ἢ is the dimension of xe k" . Moreover, the 
vector of thresholds is denoted by be R” , c, represents the NN bias term. In (B.1), the scalar 


function o(z), called the sigmoid, models the output activity of each neuron. A commonly used 


sigmoid function is o(z) = (1 +e" )" . As seen from (B.1), the SHL NN approximation is linear 


in the output weights w, and the bias term c,, but it is nonlinear in the inner-layer weights v, 


and the thresholds ὁ, In formal terms, the SHL NN universal approximation property can be 
stated as follows: 


Ve>04N,c,€R, dw, be R", Ave Κ΄: f(x)-F (x) <e,VxeD (B.2) 
Relation (B.2) can also be written in a matrix form: 
f (x)=w' G(V" x+b) +c, (B.3) 
where the vector of sigmoids δ᾽ and the n-by-N matrix V are defined below: 
6(V7x+b)=[o(vPx+h) .. o(vixtby)] εκ" (B.A) 
VAN ... Wyler (B.5) 


In order to not discriminate between the output weights w and the bias term c,, the inner- 
layer weights v, and the thresholds b, both the n-dimensional independent variable x and the N- 


dimensional vector of sigmoids σ᾽ are augmented with 1-s. Then the SHL NN approximation 
(B.3) can be written as: 


f(x)=wa(V" x] (B.6) 
where 


e=[x" 17 eR™ (B.7) 
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pay freon (B.8) 

w=[w" oc] eR (B.9) 

6(7"x)=[o(a7x) ... o(vex) 1] eR” (B.10) 


Finally, generalizing the single-output SHL NN (B.6) to handle multi-output function 
approximations yields: 


F(x)=W'G(V" x) (B.11) 


where the output weights matrix W has the number of rows equal to the dimension of the vector- 


function F(x) that is being approximated on the compact domain Dc Κ΄. Note that while x in 
the left hand side of (B.11) represents the original independent variable, the right hand side of 
(B.11) depends on the augmented variable x defined by (B.7). In what follows and without a 
loss of generality, the SHL NN approximation (B.11) will be written in the form: 


F(x)=W" 6(V" p) (B.12) 


In (B.12), w=|x' 1 ΓΕ R™ and & represents the (N +1) — dimensional vector of 
U 


sigmoids augmented with a unity. 
T 
6(V" u)=(o(vix+h) ... o(vaxtby) 1] (B.13) 
To this end, consider the system error dynamics and approximate the matched uncertainty 


K, (χ,) using SHL NN with N, sigmoids in its inner layer. The SHL NN Universal 


Approximation Theorem [46] states that given an approximation tolerance δἰ >0, and a compact 
set X c Κ΄, there must exist an integer N, and “true” constant weight matrices W and V such 


that for all oe = ACK: 


K,(x,)=W" δ ake +&,(x,) (B.14) 
᾽ 
and 
leo (x, Se (B.15) 
Let 
K,(x,)=W" δ [77 u) (B.16) 


represent the neural approximation of the uncertainty K, ea . Then the approximation error 


becomes: 
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AK, (x,)=Ko(x,)—Ko(x,)=W7 G(V" w)-W" δ(ν7 u)-& (x, ) (B.17) 


The SHL NN weigh approximation errors are defined as: 
AW=W-W, AV=V-V (B.18) 
Using Taylor series expansion of the vector-function G(V ͵ Α) around the vector V7 u,we 
get: 
6(V"u)-6(V"u)=6'(V" Hu) AV" 1 +o(fav xl | (B.19) 
where & δ΄. μ)Ξ \d G(z)/d Z} ty and O(z’) denotes Taylor series expansion terms of order 
2. In that case: 
ΡΤ σ(» 1 u)-W" OV" u)=W" δ (5 u)-W" δ γ᾽ Hw) EW δ 7 μὴ 


-AW'G (γ᾽ u)+W" (6(V" μ)-- δ (7 u)} 


-aw? a(P" μ) τ [σ (9 μ)λντ με [αν af’) — 
= Aw" δ (V7 u)+W" o'(V7u) AV" μεν Ol fava | 
Further manipulation of the terms in (B.20) leads to: 
W"G(V" w\-W' G(V" μὴ- 
= aw’ δ ("77 u)+(W-aw) "6'(P"u)av? μεν ol lar al | (B.21) 


=AW' δ (777 w)+W7 δ΄ (Vw) AV? μ-- δινῖ GV" LAV" w+" ο[ Δ ἘΠ} 
Substituting for AV =V —V into the 3™ term in (B.21) yields: 
WT σ [77 w\-W" G(V" w)=AW" δ (V7 u)+W" &(V" w)AV" p 
-Aw' &'(V"u)(V-V) ΓΔ ἢ 
- ΔΙΨΤ [σ (ν u)-a'(V"u)V" μ) +07 δ΄ (Κ᾽ AV" 


ἘΔΙΨΤ δ΄ (Vu) γ' μι Ψ' o(|av Τρ | 
- -τ A ee I, το 


(B.22) 


Ω 


Using (B.17) the uncertainty approximation error AK, (x, ) can be written as a function that 


linearly depends on the SHL NN weights estimation errors AW and AV : 
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AK, (x,)=Ko(x,)—Ko(x,)=#" 6(V" u)-W" δ (7 u)-e,(x,) (B.23) 
= a0" (9(P" 4)- σ᾽ )Ρ" ula" o'(P"u)avnaa-als,) 


Substituting (B.23) into the error dynamics equation 


é=4,,e+B,A| (K,-K*) x+(K,-Ki) u-AK,(x,) 
= A, e+ B, A(AK! x+AK) u-AK, (x, )] 
yields: 
é= Ay e+ B, A(AK, x+ AK] u—AK,(x, )] 
= A, e+ B, A(AK? x+ AK] u)-B, A(Q-€,(x,)] (B.24) 
-B. A(AW’ (δ (7 u)-6'(V"u\V? ΑἹ +07 δ΄ [9  μ)δντ u) 
To this end, consider the following Lyapunov function candidate: 
V (e, AK,, AK,, AW, AV) =e" Pe 
+trace(AK? ΓΟ AK, A)+trace(AK/T;,' AK, A} (B.25) 
+trace(AW’ Ty AW A)+trace(AV’T;,' AV) 


In (B.25), P is the unique symmetric positive definite matrix solution of the algebraic 
Lyapunov equation: 


A’ P+PA,=-O 


In (B.25), Γ., Γ΄, Γιν. Γν are symmetric positive definite matrices of the corresponding 
dimensions. The time derivative of V along the trajectories of the error dynamics is: 


V (e, AK,, AK,, AW, AV) =-e' Qe 

+2e" PB, A(AK, x+AK?] u)} 

~2e™PB AAW" [6 (77 μ]- δ (V7 u)V" 4) 

~2e™PB AW" a (Vu) AV" ps (B.26) 
+2trace( AK? Γ᾽ ke A) +2trace{ AK] Γ' kK A) 

+2 trace( AW” ie WA)+ 2trace( AV? Γ᾿ ᾽) 


-26ῖ PB, Λ(Ω--ε(.,)) 
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Using the trace identity x’ y = trace( yx") we further get: 
V (e, AK,, AK,, AW, AV) =-e' Qe 
+2 trace AK? in Κ. +xe"PB)A) 


+2 trace [AK iy K +ue'PB, A| 


(B.27) 
+2race{ aw" (I41H-(4(77 u)-o'(V7u)P" w)e"PB |) 
+2 trace AV" υ V —pe" PB, AW? δ." μ}}} 
—2e" PB, Λ(Ω--ε(.,}} 
The following adaptive laws are chosen: 
K. =. _ Proj(K, ,-xe’ PB ᾽ 
[4 ΞΙ, Proj(K, ,-ue PB } 
(B.28) 
W = Γ Proj((o (V7 αἹ-- δ. uv" u)e" PB, 
y=. Proj( se" PB WT δ΄ ( ")| 
Substitution of (B.28) into (B.27) yields: 
V (e, AK,, AK,, AW, AV) 
=~e"Qe+2trace(AV" we" PB, (1,,-A)W7 δ΄ (71 μ)}--2ε᾽ PB, A(Q-«,(x,)] 
(B.29) 


ΠΩ μυῶν )) 


+ |Al(l2]+ 5} 


Since the sigmoid & and its derivative δ΄ are bounded functions of their arguments, an 
upper bound for the second order terms in the Taylor series expansion (B.19) can be computed. 


o(lar"al = | 


δ΄ μυῖα 


y a (V" u\AV" u 


<— Ag (0) 6} +24? Bll 


6(V"u\-6(V"u)-o (VW) AV" p 


(B.30) 
=20,5.°+ 
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Using (B.30) an upper bound for Q can be obtained. 


jo}=|aw δ΄ [ΓΤ μὴν w+" ο[ αν" ἢ (B31) 


<fam| ἘΣ 


+ |W (2.0. + 


a (Vu) AV" ws 


a (V"u)\V" uw 


The uniform constant upper bound Q_ in (B.31) is due to the local properties of the 


X 


sigmoidal function o and the projection operator used in adaptive laws (B.28). 
Similar arguments applied to (B.29) yield: 
V (e, AK,, AK,, AW, AV) 


<-A 


(B.32) 
win (Ο) 6 +2[el|? 8 


{,-- Alle, Ἐ[Δ| {Ὡς Ὁ εἰν )} 


| <c,. From (B.32) it 


a (Vu) AV! 
immediately follows that the time derivative of the Lyapunov function is negative definite 
outside of the following compact set: 


where positive constant c, 1s chosen such that || 


2NP Bl (Wn Ales HAN Qn +6") 


B.33 
Ase (Q) 


ee R": 


el 


This, in turn, implies bounded tracking of the reference signal. In addition, since the 
projection operator bounds the SHL NN estimated weighs W,V and their increments AW, AV , 


all the signals in the corresponding closed-loop system remain bounded. 
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APPENDIX C 
APPROXIMATE SOLUTION OF THE 
ALGEBRAIC LYAPUNOV EQUATION 


The Lyapunov Algebraic Equation (LAE) 1s of the form: 
0=PA+A'’P+O (C.1) 


where A is the (n-by-n) — Hurwitz matrix (all eigenvalues of A are in the left half plane), Q is the 


(n-by-n) — symmetric positive definite matrix, and P is the (n-by-n) — symmetric positive-definte 


solution matrix. The exact solution of (C.1) 1s: 


oo 


P= Jexp(4"z) Oexp( Az) dr (C.2) 


0 


Let 


f 


P(t)= fexp(A"r)Qexp(Az) dr (C.3) 


0 


Then 


PA+A'P= ffexp(4"r) Qexp( Ar) A+ A" exp(A"r) Qexp(Ar)}dr 


ἢ dyexp( A’ A 3 
= djexp(4’r) Oexp(A τὴ] dt = exp(A’t)Qexp(At)-Q=P-O 


Ἶ aT ee 


P 
The resulting equation 
P=PA+A'P+O (C.4) 


is called the Lyapunov Differential (continuous) Equation (LDE). From (C.2) and (C.3) it 
follows that 


lim 


[oo 


P(e) |=? ; (C.5) 


In other words, the LDE solution P(t) approximates, (i.e., asymptotically tends to), the 
corresponding LAE solution P , forward in time. 


Conclusion: Numerical integration of LDE (C.4) yields an approximation to the true LAE 
solution. 


Example: Using AB-2 integration method, we get 
P(t,)=P(t,)A+ A" P(t,) +0 


: ὲ ᾧ (C.6) 
P= (1,)+A1(1.5P(4,)-0.5P(4,.)}, eal 25 2% 
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